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1 Introduction 

This article, which is an accompanying paper to [BLS09], consists of two parts: In section 2 
we present a version of Fenchel's perturbation method for the duality theory of the Monge- 
Kantorovich problem of optimal transport. The treatment is elementary as we suppose that 
the spaces (A, /i), (Y, v), on which the optimal transport problem [Vil03, Vil09] is defined, 
simply equal the finite set {1, . . . , N} equipped with uniform measure. In this setting the 
optimal transport problem reduces to a finite-dimensional linear programming problem. 

The purpose of this first part of the paper is rather didactic: it should stress some 
features of the linear programming nature of the optimal transport problem, which carry 
over also to the case of general polish spaces X, Y equipped with Borel probability measures 
/i, is, and general Borel measurable cost functions c : X x Y — > [0, oo]. This general setting is 
analyzed in detail in [BLS09]; section 2 below may serve as a motivation for the arguments 
in the proof of Theorems 1.2 and 1.7 of [BLS09] which pertain to the general duality theory. 

The second — and longer — part of the paper, consisting of sections 3 and 4 is of a quite 
different nature. 

Section 3 is devoted to illustrate a technical feature of [BLS09, Theorem 4.2] by an 
explicit example. The technical feature is the appearance of the singular part h s of the dual 
optimizer h G L X {X x Y, tt)** obtained in ([BLS09, Theorem 4.2]). In Example 3.1 below we 
show that, in general, the dual optimizer h does indeed contain a non-trivial singular part. 
In addition, this example allows to observe in a rather explicit way how this singular part 
"builds up", for an optimizing sequence (<p n © ip n )^ =1 £ L'(I x Y, ir) which converges to h 
with respect to the weak-star topology. The construction of this example, which is a variant 
of an example due to L. Ambrosio and A. Pratelli [AP03], is rather longish and technical. 
Some motivation for this construction will be given at the end of Section 2. 

Section 4 pertains to a modified version of the duality relation in the Monge-Kantorovich 
transport problem. Trivial counterexamples such as [BLS09, Example 1.1] show that in the 
case of a measurable cost function c : X x Y — > [0, oo] there may be a duality gap. The main 
result (Theorem 1.2) of [BLS09] asserts that one may avoid this difficulty by considering a 
suitable relaxed form of the primal problem; if one does so, duality holds true in complete 
generality. In a different vein, one may leave the primal problem unchanged, and overcome 
the difficulties encountered in the above mentioned simple example by considering a slightly 
modified dual problem (cf. [BLS09, Remark 3.4]). In the last part of the article we consider 
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a certain twist of the construction given in section 3, which allows us to prove that this dual 
relaxation does not lead to a general duality result. 



2 The finite case 

In this section we present the duality theory of optimal transport for the finite case: Let 
X = Y = {1, . . . , N} and let /i = v assign probability TV -1 to each of the points 1, . . . , N. 
Let c = {c(i,j))^j—i be an R + -valued N x N matrix. 

The problem of optimal transport then becomes the subsequent linear optimization prob- 
lem 

JV N 

(c,7r> :=J2 £t(*.J')c(*.J) ^ min > ^l* 2 , (1) 
»=i j=i 



under the constraints 

N 



J2n(i,j) = N-\ i = l,...,N, 



3 = 1 
N 



^{i,j) = N- 1 , j = l,...,N, 

i=i 

n(i,j)>0, i,j = l,...,N. 

Of course, this is an easy and standard problem of linear optimization; yet we want 
to treat it in some detail in order to develop intuition and concepts for the general case 
considered in [BLS09] as well as in section 3 . 

For the two sets of equality constraints we introduce 27V Lagrange multipliers (v(*))£Li 
and (ip(j))jL 1 taking values in R, and for the inequality constraints (4) we introduce La- 
grange multipliers (Qij)^j =1 taking values in R + . The Lagrangian functional L(ir, ip, i[>, q) 
then is given by 

JV N 

»=i j=i 

-EvwlE^i)-^- 1 
»=i \j=i 

N / N \ 

-Eft) E^)- r ' 
j=i \»=i / 

TV N 

-EE^*'^*'^' 
»=i j=i 

where the <p(i) and -00) range in R, while the g(i,j) range in R + . 

It is designed in such a way that 

C(tt) := sup L(n,<p,i>,g) = (c, tt) + Xn( ft .) (t), 

where II(/z, i/) denotes the admissible set of 7r's, i.e., the probability measures on X x Y 
with marginals /i and and . ) denotes the indicator function of a set A in the sense 
of convex function theory, i.e., taking the value on A, and the value +00 outside of A. 
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In particular, we have 



P := inf C(w) = inf sup L(w, <p, i[>, g), 

where P is the optimal value of the primal optimization problem (1). 

To develop the duality theory of the primal problem (1) we pass from inf sup L to sup 
inf L. Denote by D(ip, i[>, g) the dual function 

D(<p,i>,Q)= inf L(n,(p,i>,g) 

N N 



7l£R N 



i=l 3 = 1 



+N- 1 



N N 

»=i i=i 



(2) 



Hence we obtain as the optimal value of the dual problem 

D := sup D(<p,i>,g) = (E^] + E„[^]) - x*(^, VO 

¥>,V>,e 

where ^ denotes the admissible set of 93, i.e. satisfying 

+ ^C?) + 9(h.i) = c(i,j), l<i,j <N, 

for some non-negative "slack variables" g(i,j). 

Let us show that there is no duality gap, i.e., the values of P and D coincide. Of course, in 
the present finite dimensional case, this equality as well as the fact that the inf sup (resp. sup 
inf) above is a min max (resp. a max min) easily follows from general compactness arguments. 
Yet we want to verify things directly using the idea of "complementary slackness" of the 
primal and the dual constraints (good references are, e.g. [PSU88, ET99, AE06]). 

We apply "Fenchel's perturbation map" to explicitly show the equality P = D. Let 



T : 



piY 



be the linear map defined as 



T {(^j))i< i , j < N ) = 



I 



\ 



N 



E^j) 

\i =1 



N 



i=l 



N 



N 



so that the problem (1) now can be phrased as 

N N 



under the constraint 



(c, j) min, tt G , 

»=i 3=1 



t(-k) = ((n-\...,n- 1 ),(n-\...,n- 1 )) 



The range of the linear map T is the subspace E C M. x R , of codimension 1, formed by 

N N 

the pairs (f,g) such that /(*) = 2 9U)> m other words E M [/] = E„[g]. We consider T 

i=i j=i 

as a map from to E and denote by E + the positive orthant of E. 
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Let <I> : E + -> [0, oo] be the map 



$(/, 5 )=inf{(c,7r), TreKf , T(tt) = (/,<?)} . 

We shall verify explicitly that $ is an R + -valued, convex, lower semi-continuous, positively 
homogeneous map on E+. 

The finiteness and positivity of $ follow from the fact that, for (f,g) G E + , the set of 
7r G with T(7r) = (/, g) is non-empty and from the non-negativity of c. As regards 
the convexity of <&, let <7i), (/ 2 , <? 2 ) S £?+ and find 7Ti,7T2 G R + such that T(7Ti) = 
T(tt 2 ) = (/2,.92) and (c,tti) < +e as well as (c,7r 2 ) < $(/ 2 ,ff2) + £■ Then 

g A/i,gi) + (/2,g2) \ < / 7r 1+7 r 2 \ ^ $(/i, g i) + $(/ 2 ,g 2 ) | g 



which proves the convexity of $. 

If {(fn,g n ))n=i e ^+ converges to (f,g) find (tt„)^° =1 in R^ 2 such that T(ir n ) = (/„,#„) 
and (c, 7r„) < &(f n ,gn) + n~ 1 . Note that (7r„)^L 1 is bounded in R^ , so that there is a sub- 
sequence {it nk )kLi converging to 7r G R+ 2 . Hence $(/,<?, ) < (c, 7r) showing the lower semi- 
continuity of <&. Finally note that $ is positively homogeneous, i.e., <&(A/, A#) = A$(/, g), 
for A > 0. 

The point (/o,<?o) with /o = go = . . . , iV -1 ) is in _E + and $ is bounded in a 

neighbourhood V of (.fo,5o)- Indeed, fixing any < a < iV _1 the subsequent set V does the 
job 

V = {(f,g) G E : - A" 1 ! < a, - 1 1 < a, for 1 < i,j < N}. 

The boundedness of the lower semi-continuous convex function $ on V implies that the 
subdifferential of $ at (/o,ffo) is non-empty. Considering $ as a function on R 2JV (by 
defining it to equal +00 on R 2N \E + ) we may find an element {(p,ip) € R W x R^ in this 
subdifferential. By the positive homogeneity of $ we have 

<&(/,<?) > <(£,V0, (/,<?)> = <£,/} + $,g), for (/,<?) ei"x R w , 

and 

P = $(fo,go) = ($,fo) + $,go). 

By the definition of $ we therefore have, for each n G R+ 2 , 

(c,tt)> inf {(ctt) : T(tt) = T(tt)} 

> (T(7r),(0,^)> 

JV N 

= EI>(m)[£(o+#o')] 
»=i j=i 

so that 

c(i, j) > + V>(j), for 1 < i,j < n. (3) 

By compactness, there is 7? G 1/), i.e., there is an element 7? G R+ 2 verifying T(tt) = 
(fa, go) such that 

(c,tt) = + (4) 
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Summing up, we have shown that tt and ((p, ip) are primal and dual optimizers and that 
the value of the primal problem equals the value of the dual problem, namely (ip + ip, tt). 

To finish this elementary treatment of the finite case, let us consider the case when we 
allow the cost function c to take values in [0, oo] rather than in [0,oo[. In this case the 
primal problem simply loses some dimensions: for the (i, j)'s where c(i,j) — oo we must 
have = so that we consider 

N N 

(c, 7r) := Y^ihj) c (h i) -> min, tt G M + , 
i=i j=i 

where we now optimize over tt G R+ 2 with — if c(i, j) = oo. For the problem to 

make sense we clearly must have that there is at least one tt G n(/z, v) with (c, 7r) < oo. If this 
non-triviality condition is satisfied, the above arguments carry over without any non-trivial 
modification. 

We now analyze explicitly the well-known "complementary slackness conditions" and 
interpret them in the present context. For a pair 5? and (ip, ip) of primal and dual optimizers 
we have 

c(i,j) > + i>(j) => = 0, 

and 

n > => c(i,j) = (p{i) + 

Indeed, these relations follow from the admissibility condition c > (p + tp and the duality 
relation (n, c — (<p + tp)) = 0. 

This motivates the following definitions in the theory of optimal transport (see, e.g., 
[RR96] for (a) and [ST08] for (b).) 

Definition 2.1. Let X = Y = {1, . . . , N} and \i = v the uniform distribution on X and Y 
respectively, and let c : X x Y — >• M + be given. 

(a) A subset T C X xY is called "cyclically c-monotone" if, for (ii,ji),---,(i n ,jn) G T we 

have 

n n 

^2c(ik,j k ) < ^c(i fe ,jfe+i), (5) 
fe=i fc=i 

where j n+1 = ji. 

(b) A subset T C X x Y is called "strongly cyclically c-monotone" if there are functions 

ip,ip such that ip(i) + tp{j) < c(i,j), for all G X x Y, with equality holding true 

for G T. 

In the present finite setting, the following facts are rather obvious (assertion (iii) following 
from the above discussion): 

(i) The support of each primal optimizer 7? is cyclically c-monotone. 

(ii) Every tt G II(/i, v) which is supported by a cyclically c-monotone set T, is a primal 

optimizer. 

(iii) A set V C X x Y is cyclically c-monotone iff it is strongly cyclically c-monotone. 
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In general, one may ask, for a given Monge-Kantorivich transport optimization problem, 
denned on polish spaces X, Y, equipped with Borel probability measures /U, v, and a Borel 
measurable cost function c:IxF-f[0,oo], the following natural questions: 

(P) Does there exist a primal optimizer to (1), i.e. a Borel measure n £ TL(n,v) with 
marginals /j,, v, such that 



/c dn = inf / 
Tren^.i/) J 



cdir=: P 

XxY XxY 

holds true? 



(D) Do there exist dual optimizers to (2), i.e. Borel functions (tp, in \I>(/i, v) such that 



tp d/i+ / ijj dv — sup I / tp d/j, + / ip dv I =: D, (6) 

J (^)ev(^) J J 

where ^(/i, v) denotes the set of all pairs of [— oo, +oo[- valued integrable Borel functions 
(ip, tp) on X, Y such that <p(x) + tp(y) < c(x, y), for all (x, y) £ X x Yl 

(DG) Is there a duality gap, or do we have P = D, as it should - morally speaking - 
hold true? 



These are three natural questions which arise in every convex optimization problem. In 
addition, one may ask the following two questions pertaining to the special features of the 
Monge-Kantorovich transport problem. 

(CC) Is every cyclically c-monotone transport plan it £ Tl(p, v) optimal, where we call 
7r £ II(/i, v) cyclically c-monotone if there is a Borel subset r C X x Y of full support 
7r(r) = 1, verifying condition (5), for any (x\,y\), . . . , (x n ,y n ) £ T? 

(SCC) Is every strongly cyclically c-monotone transport plan tt £ II(/x, v) optimal, where 
we call 7r G II(/i, v) strongly cyclically c-monotone if there are Borel functions tp : X — > 

[— oo, +oo[ and tp : Y — > [-co, +oo[, satisfying tp(x) + ip(y) < c(x, y), for all (x, y) £ X xY, 
and ir{tp + if> = c} = 1? 

Much effort has been made over the past decades to provide increasingly general answers 
to the questions above. We mention the work of Ruschcndorf [Rus96] who adapted the 
notion of cyclical monotonicity from Rockafellar [Roc66]. Rockafcllar's work pertains to 
the case c(x,y) = —(x,y), for x, y £ R™, while Ruschcndorf 's work pertains to the present 
setting of general cost functions c, thus arriving at the notion of cyclical c-monotonicity. 
Intimately related is the notion of the c-conjugate <p c of a function tp. 

We also mention G. Kellerer's fundamental work on the duality theory; in [Kel84] he 
established that P = D provided that c : X x Y — > [0, oo] is lower semi-continous, or merely 
Borel-measurable and uniformly bounded. 

The seminal paper [GM96] proves (among many other results) that we have a posi- 
tive answer to question (CC) above in the following situation: every cyclically c-monotone 
transport plan is optimal provided that the cost function c is continuous and X, Y are com- 
pact subsets of R". In [Vil03, Problem 2.25] it is asked whether this extends to the case 
X = Y = W 1 with the squared euclidian distance as cost function. This was answered inde- 
pendently in [Pra08] and [ST08] : the answer to (CC) is positive for general polish spaces X 
and Y, provided that the cost function c : X x Y — > [0, oo] is continuous ([Pra08]) or lower 
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semi-continuous and finitely valued ([ST08]). Indeed, in the latter case, a transport plan is 
optimal if and only if it is strongly c-monotone. 



Let us briefly resume the state of the art pertaining to the five questions above. 

As regards the most basic issue, namely (DG) pertaining to the question whether duality 
makes sense at all, this is analyzed in detail — building on a lot of previous literature — in 
section 2 of the accompanying paper [BLS09]: it is shown there that, for a properly relaxed 
version of the primal problem, question (DG) has an affirmative answer in a perfectly general 
setting, i.e. for arbitrary Borel-measurable cost functions c : X x Y ^ [0, 00} defined on the 
product of two polish spaces X, Y, equipped with Borel probability measures [i, v. 

As regards question (P) we find the following situation: if the cost function c : X x Y — > 
[0, 00] is lower semi-continuous, the answer to question (P) is always positive. Indeed, for 
an optimizing sequence {n n )%Li m n(/U, v), one may apply Prokhorov's theorem to find a 
weak limit n = linifc_ ) . cx) n, lk . If c is lower semi-continuous, we get 



which yields the optimality of tt. 

On the other hand, if c fails to be lower semi-continuous, there is little reason why a 
primal optimizer should exist (see, e.g., [Kel84, Example 2.20]). 

As regards (D), the question of the existence of a dual optimizer is more delicate than 
for the primal case (P): it was shown in [AP03, Theorem 3.2] that, for c : X x Y — > R+, 
satisfying a certain moment condition, one may assert the existence of integrable optimizers 
((p,ip). However, if one drops this moment condition, there is little reason why, for an 
optimizing sequence (}Pm' l Pn)'^Li m (D) above, the L 1 -norms should remain bounded. Hence 
there is little reason why one should be able to find integrable optimizers (ip, ip) as shown by 
easy examples (e.g. [BS09, Examples 4.4, 4.5]), arising in rather regular situations. 

Yet one would like to be able to pass to some kind of limit (ip, ip), whether these functions 
are integrable or not. In the case when (p and/or ip fail to be integrable, special care then 
has to be taken to give a proper sense to (6). 

This situation was the motivation for the introduction of the notion of strong cyclical 
c-monotonicity in [ST08]: this notion (see (SCC) above) characterizes the optimality of 
a given ir G n(^i, v) in terms of a "complementary slackness condition" , involving some 
(ip,ip) 6 ^(Mj v )i playing the role of a dual optimizer ((p,ip). The crucial feature is that 
we do not need any integrability of the functions ip and ip for this notion to make sense. It 
was shown in [BS09] that, also in situations where there are no integrable optimizers (ip,ip), 
one may find Borel measurables functions ((p,ip), taking their roles in the setting of (SCC) 
above. 

This theme was further developed in [BS09], where it was shown that, for /x®^-a.s. finite, 
Borel measurable c : X x Y — > [0, 00], one may find Borel functions (p : X — >• [—00, +00) 
and ip : Y — > [—00,00), which are dual optimizers if we interpret (6) properly: instead of 




XxY 



XxY 



considering 




(7) 



x 



Y 



which needs integrability of ip and ip in order to make sense, we consider 




(8) 



XxY 
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where the transport plan ir G n(/i, v) is assumed to have finite transport cost J XxY c ( x > y)dir(x, y) < 
oo. If (7) makes sense, then its value coincides with the value of (8); the crucial feature 
is that, (8) also makes sense in cases when (7) does not make sense any more as shown in 
[BS09, Lemma 1.1]. In particular, the value of (8) does not depend on the choice of the 
transport plan tt G n(/i, v), provided tt has finite transport cost J XxY c i x , y)dn(x, y) < oo. 

Summing up the preceding discussion on the existence (D) of a dual optimizer (ip,^): 
this question has a - properly interpreted - positive answer provided that the cost function 
c:lxy^[0,co]is^i® u-a.s. finite ([BS09, Theorem 2]). 

But things become much more complicated if we pass to cost functions c: XxY ^ [0, oo] 
assuming the value +oo on possibly "large" subsets of X x Y. 

In [BLS09, Example 4.1] we exhibit an example, which is a variant of an example due to 
G. Ambrosio and A. Pratelli [AP03, Example 3.5], of a lower semicontinuous cost function c : 
[0, 1) x [0, 1) — > [0, oo], where (X, fi) = (Y, v) equals [0, 1) equipped with Lebesgue measure, 
for which there are no Borel measurable functions tp,ip verifying <p(x) + tp(y) < c(x,y), 
maximizing (8) above. 

In this example, the cost function c equals the value +oo on "many" points of X x Y = 
[0, 1) x [0, 1). In fact, for each x G [0, 1[, there are precisely two points 2/1,1/2 € [0, 1[ such 
that c(x, j/i) < 00 and c(x, 2/2) < 00, while for all other y G [0, 1[, we have c(x, y) = 00. 
In addition, there is an optimal transport plan 7? G n(/i, v) whose support equals the set 
{(x,y) G [0,1) x [0,1) : c(x,y) < 00}. 

In this example one may observe the following phenomenon: while there do not exist 
Borel measurable functions <p : [0,1) — \ [—00, +00) and tp : [0,1) — > [—00,00) such that 
<p{x) + ip{y) = c(x,y) on {c(x, y) < 00}, there does exist a Borel function h : [0,1) x 
[0,1) — > [—00,00) such that h{x,y) = c(x,y) on {c(x, y) < 00} and such that h{x,y) = 
\im n ^ 00 (p n (x) + ip n {y)) where V>n)S£Li are properly chosen, bounded Borel functions. 
The point is that the limit holds true (only) in the norm of L 1 ([0, l[x [0, 1[, 7r) as well as 
7?-a.s. 

In other words, in this example we are able to identify some kind of dual optimizer 
h G i 1 ([0, 1) x [0, 1),7?) which, however, is not of the form h(x,y) = <p{x) + tp(y) for some 
Borel functions (ip,ip), but only a 7?-a.s. limit of such functions (ip n (x) + tp n (y))^ =1 . 

In [BLS09, Theorem 4.2] we established a result which shows that much of the positive 
aspect of this phenomenon, i.e. the existence of an optimal h G L 1 ^), encountered in the 
context of the above example, can be carried over to a general setting. For the convenience 
of the reader we restate this theorem and the notations required to formulate it. 

Fix a finite transport plan ttq G II(/i, v, c) := |tt G n(/i, v) : J XxY cdir < 00 j. We denote 
by n^")^, v) the set of elements tt G II(^, v) such that 7r <; ir and || ^|| iO o( W0 ) < °°- 

Note that U^(fi, v) = U(/j,,v) ni°°(7ro) C U(fj,,v,c). We shall replace the usual Kan- 
torovich optimization problem over the set II(/i, v, c) by the optimization over the smaller 
set H^°>(fj,,v). Its value is 

=inf{(c,7r) =Jcdn: jrenW (n,v)}. (9) 
As regards the dual problem, we define, for e > 0, 

£)(*o,<0 = sup j J tpd/j, + J tpdv: (p & L 1 (fj,) , tp G L 1 (u) , 

/ (<p(x) + il>(y) - c(x,y)) + d7r < e\ and 

J XxY ' 

D M = ii m £)K,e) / 10 x 
£^0 
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Define the "summing" map S by 

S : L\X,n) x L x {Y,v) -> L\X x y,7r ) 

where ip (B ip denotes the function <p(x) + ^>(?/) on X x Y. Denote by Lg(X x Y, 7r ) the 
||. ||i-closed linear subspace of L X (X x F,7To) spanned by S(L 1 (X,^i) x Clearly 
x Y", 7r ) is a Banach space under the norm ||.||i induced by L X (X x Y, n ). 
We shall also need the bi-dual Lg(X x Y, 7r )** which may be identified with a subspace 
of L X {X x y, 7To)**. In particular, an element h £ -^(-X" x y, 7To)** can be decomposed into 
h = h r + h s , where h r e L'(X x y 7r ) is the regular part of the finitely additive measure h 
and h s its purely singular part. 

Theorem 2.2. Let c : X x Y — > [0, oo] &e Sore/ measurable, and let n £ f, c) &e a 
/iraie transport plan. We have 

p(*o) _ £)(Tn) _ 

TTiere is an element h £ Lg(A x y, 7r )** swc/i iftai /i < c and 

= (Mo). 

IfirG n( 7r °)(;U, ^) (identifying n with j^) satisfies j cdn < pf 71 "") + a /or some a > 7 then 

\(h s ,n)\<a. (12) 

In particular, if tt is an optimizer of (9), then h s vanishes on the set {J^ > 0}. 
In addition, we may find a sequence of elements (<p n ,ipn) € ^Hm) x ^(v) such that 

<P„ ® ll>n ^ h r , TTo-Cl.S., \\(lfi n 8 Vb n - n r ) + || Ll(7ro) ~+ 

and 

Jim sup lim -((v?„ 8 Vn)lA,7ro> = ||ft s ||z,i( Wo )"- ( 13 ) 

The assertion of the theorem extends the phenomenon of [BLS09, Example 4.1] to a 
general setting. There is, however, one additional complication, as compared to the situation 
of this specific example: in the above theorem we only can assert that we find the optimizer 
h in L 1 (7r)** rather than in L 1 (7r). The question arises whether this complication is indeed 
unavoidable. The purpose of the subsequent section is to construct an example showing 
that the phenomenon of a non-vanishing singular part h s of h = h r + h s may indeed arise 
in the above setting. In addition, the example gives a good illustration of the subtleties of 
the situation described by the theorem above. 



3 The singular part of the dual optimizer 

In this section we refine the construction of Examples 4.1 and 4.3 in [BLS09] (which in turn 
are variants of an example due to G. Ambrosio and A. Pratelli [AP03, Example 3.2]). We 
assume that the reader is familiar with these examples and freely use the notation from this 
paper. 

In particular, for an irrational a £ [0, 1) we write, for k £ Z, 1 

g k (x) = l + #{0 < i < k : x®ia £ [0, i)} 

i (14) 
- #{0 < i < k : x®ia £ [|,1)}, 

1 In [BLS09] the constructions arc carried out for N instead of Z, but for our purposes the latter choice 
turns out to be better suited. 
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where, for k < 0, we mean by < i < k the set {k + 1, k + 2, . . . , 0} and © denotes addition 
modulo 1. We also recall that the function h : [0,1) x [0,1) — > 7L is defined in [BLS09, 
Example 4.3] as 

I Qk(x), k e Z and y = x © A;a , 1C . 

Mz,y) = < . ( 15 ) 

loo, otherwise. 

In [BLS09, Example 4.3] we considered the [0, oo]- valued cost function c(x, y) := h+(x,y), 
the positive part of the function h. We now construct an example restricting h + (x,y) to a 
certain subset of [0, 1) x [0, 1). 

Example 3.1. Consider X = Y = [0, 1) and denote by \x resp. v the Lebesgue measure on 
X, resp. Y. There is an irrational a E [0,1) and a map t : [0, 1) — > 7L such that, for 

r = {(x,x),xG [0,1)}, 

Ti = {(x,x§a) : x 6 [0,1)}, 

T T = {(x, x © t{x)q) : x 6 [0, 1)} 

and letting 



c(x,y) 



h+{x, y ), /orser uriur T 

oo, otherwise 



the following properties are satisfied, 
(i) The maps 

T°(x)=x, T^(x)=x®a, T^\x) =x®{t{x)o) 

are measure preserving bisections from [0, 1) to [0, 1) with respect to the Lebesgue mea- 
sure (/i in the present setting). Denote by 7i"o , 7Ti , 7r r the corresponding transport plans 
in H(fi, /«), i.e. 

tt = {id,id) # n, 7Ti = (id,T a )#n, tt t = (id, T^ r) ) # ^, 
and let ir = (ttq + tti + n T )/3. 
(ii) The transport plans tt and m are optimal while tt t is not. In fact, we have 

(C, 7To) = (C, 7Tl) = 1 While (c, 7T T ) > (h,TT T ) > 1. (16) 

(Hi) There is a sequence (ip n , ■i/»„)^. 1 o/ bounded B or el functions such that 

(a) (p n (x) + rp n (y) < c(x,y), forxeX,yeY, (17) 

(6) lim ( / tp n {x)du{x)+ il> n (y),dv{y)) = 1, (18) 

(c) lim ((f n (x) + ipn{y)) = ^(^5 y), tt -almost surely. (19) 

n— >oo 

(^wj Using the notation of [BLS09, Theorem ^.2} we find that for each dual optimizer 
h E i 1 (7r)**, which decomposes as h = h r + h s into its regular part h r G L 1 ^) and its 
purely singular part h s e i 1 (7r)**, we have 

h r = h, TT-a.s., (20) 

and the singular part h s satisfies H^H^i^)*. = (h,w T ) — 1 > 0. In particular, the 

singular part h s of h does not vanish. The finitely additive measure h s is supported by 
T T , i.e. (K s ,l ro + l ri ) =0. 
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We shall use a special irrational a £ [0, 1), namely 

oo ^ 



where Mj = m,\m,2 ■ ■ ■ m,j — Mj-imj, and (mj)^.! is a sequence of prime numbers rrij > 5 
tending sufficiently fast to infinity, to be specified below. We let 



n 1 



which, of course, is a rational number. 

We will need the following lemma. We thank Leonhard Summerer for showing us the 
proof of Lemma 3.2. 

Lemma 3.2. It is possible to choose a sequence m\,m,2, ■ ■ ■ of primes growing arbitrarily 
fast to infinity, such that with Mi — mi, M 2 = mi ■ m 2 , . . . , M n = mi • • ■ m„, ... we have, 
for each n E N, 



. ,Mj ~ M„' 



wii/i P„ a^c? M„ relatively prime. 
Proof. We have 

El _ m 2 . . . m„ + . . . + m n + 1 _ P n 

thus P„ and M„ are relatively prime, if and only if 

mi f m 2 • • • m n + m 3 ■ ■ ■ m n + . . . + m n + 1 (21) 

m 2 f m 3 - ■ ■m n + ...+ m„ + 1 (22) 

: : (23) 

m n _i | m„ + 1. (24) 

We claim that these conditions are, e.g., satisfied provided that we choose mi, m 2 , . . . such 
that mi > 3 and 

m i+ i = +1 {rm) (25) 
m i+J = -1 (mi) if j > 2. (26) 

for all i > 1. Indeed (25), (26) imply that for fc e {1, . . . , n — 1} we have modulo (mfc) 

mfe+i---m„+ ratt2'" ffl n+ mfc+3'--™n+ • • •+ m„+ 1 = 

(±1)+ (±1)+ (Tl)+ -..+ (-1)+ (+1), 

where in the second line the (n — k + 1) summands start to alternate after the second term. 
Thus, for even n — k, this amounts to 

m fc+ i • • • m„+ m fe+2 • • • m n + m k+3 ■ ■ ■ m n + . . .+ m n + 1 

I ■ (-1)+ (+1)+ -..+ (-1)+ = 
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while we obtain, for odd n — k, 

m k +i ■ ■ ■ m n + m k+2 ■ ■ ■ m„+ m k+3 ■ ■ ■ m n + 



(+1)- 



(+1)+ 



(-1) + 



m„+ 1 = 

(-1)+ (+l) = +2 



Henee (21)-(24) are satisfied as the m n where chosen such that m„ > 2. 

We use induction to construct a sequence of primes satisfying (25) and (26). Assume that 
mi, . . . , m, have been defined. By the Chinese remainder theorem the system of congruences 



-1 (mi), 



-1 (rrii-i), x = +1 (mi) 



has a solution x € {1, . . . , mi . . . m^}. By Dirichlet's theorem, the arithmetic progression 
x + kmi . . .nii,k 6 N contains infinitely many primes, so we may pick one which is as large 
as we please. The induction continues. □ 

For f3 e [0, 1), denote by T fj : [0, 1) [0, l),Tp(x) := x © /3 the addition of [3 modulo 1. 
With this notation we have T^ 1 = id and, by Lemma 3.2, it is possible to choose mi, . . . , m n 
in such a way that M n is the smallest such number in N. Our aim is to construct a function 
t : [0, 1) — >■ Z such that the map 



T (r) . 



[0,1) 

x 



[0,1) 



^ Tk T >(x)=T^ x \x) 



defines, up to a /i-null set, a measure preserving bijection on [0,1), and such that the 
corresponding transport plan 7r T e Tl(p, v), given by ir T = (id^T^)^^, has the properties 
listed above with respect to the cost function c(x, y) which is the restriction of the function 
h + (x,y) to T U Ti U T T . We shall do so by an inductive procedure, defining bounded Z- 
valued functions t„ on [0, 1) such that the maps are measure preserving bijections on 

[0, 1). The map ri r) then will be the limit of these T^ n) . 

Step n=l: Fix a prime Mi = mi > 5, so that ai — -p-. Define 



■= [^r 1 ,^), fci = l,-..,Mi, 



so that (-ffci)^Li forms a partition of [0, 1) and T ai maps I kl to I kl +i, with the convention 
/jifi+i = h- We also introduce the notations 

Ll [°4-2sr) and Rl [\ + 2k' 1 ) 

for the segments left and right of the middle interval 

rl t _ rl 1 1 j l \ 

-'middle •— i (Mi+l)/2 — [;> 2MT' 2 ' 2MT'' 

We define the functions (p 1 ,^ 1 on [0, 1) such that ip 1 {x) + xp 1 {x) = 1 and 

f0 xeL 1 

^(x)+^(T ai (x)) = 



1 X G ^middle 

2 x e R 1 



which leads to the relation 



<p\T ai (x)) = <p 1 (x) + < 



x e L 1 , 

X ^ ^middle 

X e R 1 . 
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Making the choice ip 1 = on I\ this leads to 



1 {h-l, xGl kl ,k 1 G{l,...,(M 1 + l)/2}, 

V {> [Mt + l-ki, xel kl ,ki G{(Mi + 3)/2,Mi}, 
^{x) = 1 - ^(x). 

The function tp 1 starts at 0, increases until the middle interval, stays constant when stepping 
to the interval right of the middle, and then decreases, reaching 1 on the final interval Imx ■ 
The idea is to define the map T\ : [0, 1) — > Z in such a way that the map 



[0,1) -> [0,1) 



T£ (x) (x) 



is a measure preserving bijection enjoying the following property: the map 

equals the value two on a large set while it has concentrated a negative mass which is close 
to —1 on a small set. 

This can be done, e.g., by shifting the first interval I\ to the interval Iim x -i)/2i which is left 
of the middle one, while we shift the intervals I2, ■ ■ ■ , I(m 1 -i)/2 by one interval to the left. 
On the right hand side of [0, 1) we proceed symmetrically while the middle interval simply 
is not moved. 




Fig. 1. Representations of ip 1 and r . 

The step function is (p 1 and the arrows indicate the action of This figure corresponds 

to the value Mi = 11. 

More precisely, we set 



ti{x) = { 



( Afi -3 

2 ' 

-1, 

0, 

1. 

Mi -3 



x e h, 

xel kl ,ki e {2, . . . , (Mi - i)/2}, 

X € /(Mi+l)/2) 

xel kl ,k x G{(Afi + 3)/2,...,M 1 }, 
x e I Mi ■ 



(28) 
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Then T^ 1 induces a permutation of the intervals (Iki)k =1 an< ^ a snor t calculation shows 
that 

a:eI fcl ,fciG{2,...,(Af 1 -l)/2, 

. L a; G I(m 1 +i)/2- 

Next figure is a representation of this "quasi-cost" at level n = 1, with the same value 
Mi = 11 as in Figure 1. 



2 
1 -f 





Mi -5 
2 









1 


i i 





± middle 




1 


Fig. 2. Representation of ip 1 + ip 1 o Ta^. 



Assessment of Step n = 1. Let us resume what we have achieved in the first induction step. 
For later use we formulate things only in terms of < J 9 1 (-) rather than V^O = 1 — f (')■ 
For the set Jf = {2, . . . , y {-^5^, . . . , M x - 1} of Vod 2 indices" we have 



^(z) - ^(T^x)) = 1, x G Jjt^fcx 6 Jf, 
while for the set = { 1 , Mi } of "singular indices " we have 

Mi -3 



so that 



]T / [<p\x)-<p\T£\x))]dx = 



X G I kl , fci G J s 



(30) 
(31) 



Mi - 3 2 
2 Mi 



= -1 



3 
Mi' 



For the middle interval /fiddle = / (m 1 +i)/2 we have ^(x) - ^ x (T^(x)) = 0. 
We also note for later use that, for x £ [0, 1), the orbit (T£ (x))Jl^ never visits i^ddle - Here 
we mean that i runs through {n(x), ri(x) + 1, . . . , — 1} when Ti(x) < and runs through 
the empty set when T\(x) = 0. 

Step n=2: We now pass from ati = j^- to a 2 = -jj- + where M 2 = Mim 2 = mim 2 and 
where to 2 , to be specified below, satisfies the relations of Lemma 3.2 and is large compared 
to Mi. For 1 < ki < Mi and 1 < fc 2 < to 2 we denote by J r fc 1> fc 2 the interval 



fel ,&2 



-1 fci-1 



Ml 



M 2 ' Mi 



We use the term "good" rather than "regular" as the abbreviation r is already taken by the word "right" 
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Similarly as above we will also use the notations L — L>J , ^ 2MJ 

t2 _ T _ fl i 1 , __1 \ 

i middlo — 1 (M 1 +l)/2,(m 2 + l)/2 — [5 2Mj ' 2 "+" 2MjJ' 

We now define functions (fi 2 ,ip 2 such that f 2 {x) + tp 2 (x) = 1 and 



[°>h--sk)> R2 = [h + *k> 1 )> aad 



V 2 (x)+^ 2 (T a2 (x))={ 
This is achieved if we set, e.g., <p 2 = on and 



0, x £ L 2 
1 

2, x e i? 2 . 



2- *= ^middle' 



X G L 2 , 

X G ^middle' 



-1 iGfl 2 , 

7/> 2 (a;) = 1 - </? 2 (:r). 

Yet another way to express this is to say that for j £ {0, . . . , M 2 — 1} we have 



^(U 2 {x)) = #{i £ {0, . . . , j - 1} : T* 2 (x) G L 2 } 

-#{ie{0,...,j-l}:r Qj (i) Gi? 2 }' 



X £ I 



1,1) 



(32) 



(33) 



in analogy to (14). 

While the function <p 1 (x) in the first induction step was increasing from I\ to I(m 1 +i)/2 
and then decreasing from I/m 1 +3)/2 to %, the function f 2 {x) displays a similar feature on 
each of the intervals J^: roughly speaking, i.e. up to terms controlled by M\, it increases 
on the left half of each such interval and then decreases again on the right half. The next 
lemma makes this fact precise. We keep in mind, of course, that m 2 will be much bigger 
than Mi. 

Lemma 3.3 (Oscillations of ip 2 ). The function ip 2 defined in (32) has the following proper- 
ties. 



(1) \<f( x )-tf{x®£.)\ < 4M 2 , x£ [0,1). 
(ii) For each 1 < k[, k l < Mi we have 

Proof. Let us begin with the proof of (i). 

• Proof of (i). While T^ 1 = id holds true, we have that T^ 1 is only close to the identity 
map. In fact, as T a2 (x) = x © , wc have 



Ta 2 1 (x) — X © j£ . 



M, 



(34) 



Somewhat less obvious is the fact that T™ 2 2 also is close to the identity map. In fact 

T^- 2 (x)=xQ^. (35) 
Indeed, by (25) applied to i = 1, there is c G N such that m 2 = cM\ + 1. Hence 

,1712 + 1 



T^- 2 (x)=x®(m 2 -2)- 
= x © (cMi - 1) 



M 2 

m 2 + 1 
M 2 



cM 2 - m 2 + (ro 2 - 2) _ 2 

— X tt? ~r~z — X (3 , 7 . 

M 2 M 2 
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Here is one more remarkable feature of the map T™ 2 2 . 

Claim: For x e [0,1) the orbit (T^x))™^ -2 visits the intervals L 2 = [0, \ — ^pj) 

R 2 = [i + 21^2 1 approximately equally often. More precisely, the difference of the visits of 

these two intervals is bounded in absolute value by AM\. 

Indeed, by Lemma 3.2, the orbit (T£ (x))^ visits each of the intervals Ik lt k 2 exactly one 
time so that it visits L 2 and R 2 equally often, namely M2 ~ 1 times. The Mi many disjoint 

subsets (tI^ 2 2 -* {Ta 2 (x))™2i 2 ) . °f this orbit are obtained by shifting them successively 

by 2/M 2 to the left (35). As the difference (T^ {x))fi\\ (tZ { ™ 2 ~ 2) (T* 2 (x))™ 2 " 2 ) ^ consists 

1 i =1 M 

only of 2Mi many points we have that the difference of the visits of (Ti 2 m2 2 ^ (T^x))™ 2 ) 

V l — J j=i 

to L? and R 2 is bounded by 4M X . This implies that the difference of the visits of (T* 2 (x))™ 2 f 2 

to L? and R 2 can be estimated by 4Mi too: indeed, if this orbit visits 4Mi + k many times 

L 2 more often then R 2 (or vice versa) for some k > 0, then (T" 2 ~ 2 (T* 2 (x)))™ 2 f 2 visits ^ 2 at 

least 4Mi + k - 4 many times more often than i? 2 etc. and finally (T^ l(m2 ~ 2) (T^ 2 (a;)))™ 2 " 2 

visits L 2 at least k many times more often than R 2 which yields a contradiction. Hence we 

have proved the claim. 

To prove assertion (i) note that by (34) and (35) 

V M oT^(x)=x© 1 l 7 (36) 



Mi — 1 ^ 

We deduce from the claim that the difference of the visits of the orbit (T^ 2 ) i=( 2 

Mi-l, 



to L 2 and i? 2 is bounded in absolute value by 2 (4Mi) + Mi which proves (i). 

• Proof of (ii). As regards (ii) suppose first fcj = fc" =: ki. Note that, for x £ /[ eft := 

+ 2MT - we have that the orbit PU^o 1 visits L 2 one time more 

often than R 2 , namely Ml 2 +1 versus Ml 2 ~ 1 times. If we start with x 6 then, for 

1 < j < 535^ - 1 we have that T^{x) e l\f . Hence, for the orbit (T^L 2 / ^" 1 ^ 1 , 
the difference of the visits to the interval L 2 and i? 2 equals [tm~\ ~ 1; the integer part of 



1112 



2jlY — 1 . Combining this estimate with the estimate (i) as well as the fact that the distance 

between x 8 (L^grJ ~ l) Hr and x ® V rKit is bounded b y 2 ^ 2 ~\ we obtain, for x € J fcl ,i 
and y € I m2 +i , that 



fci, 2 



rt)-^(*)>^(T« 1 (x))-^ 2 (x)- ^( y )-^(r Q2 ^ J (x)) 
>(LfMrJ-i)-(2Mi-i)(4M 2 ) 



-2Mi 

22. 

2 Mi 



> "2 _ O il .f 3 



Passing to the general case 1 < k[ , k'{ < Mi observe that T a \ ~ 1 maps / , m 2 +i to 

fcn 2 

/ m 2 +i , „ , , • Using again (i) we obtain estimate (ii) . □ 

We now are ready to do the inductive construction for n = 2. For to 2 satisfying 
the conditions of Lemma 3.1 and to be specified below, we shall define r 2 : [0, 1) — > 
{— M 2~ 1 , . . . , 0, . . . , M 2~ 1 }, where M 2 = mifni, such that the map 

T (r 2) .j [0,1) -> [0,1) 

" 2 ' I x ^ Til 2) (x) := T T a l (x \x) 
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has the following properties. 



(i) The measure-preserving bijection T { J 2 2) : [0,1) -> [0,1) maps each interval Ik t onto 
Tai (J/sJ. It induces a permutation of the intervals // Sl ,fe 2 , where 1 < fei < Mi, 1 < 
fc 2 < m 2 . 

(ii) When r 2 (x) > 0, we have 

T l a2 (x)t Middle. *-0,...,r 2 (x), (37) 
and, when r 2 (x) < 0, we have 

£ Buddie. » = 75(a;),...,0. (38) 

(hi) On the Vod" intervals J fcl , where fci £ Jf = {2, . . . , ^=±} U , . . . , Mi - 1}, for 

which we have, by (30), 

<p\x)-<p\T™{x)) = \, 
the function t 2 will satisfy the estimates 

An^/T!}]^*], (39) 

and 

/4/Vf 2 

|l-/ (a; ) + ^ (T ( ? ) (a;)) | da;< _ j_. {40) 

KifcJJ - fc l 2 

(iv) On the "singular" intervals Jfcj , where k\ G Jf = {1, Mi}, for which we have , by (31), 

we split {l,...,m 2 } into a set J kl ' 9 of "good" indices, and a set J kl ' s of "singular" 
indices, such that 

<p 2 (x) - p 2 (T^\x)) = 0, for x G 4 1>fe2 ,fc 2 G J fcl < s , 

while 

^ 2 (x) - ^ 2 (T^)(x)) < -^jl + 20M? for x G J fcllfa ,fc 2 G J fcl <* 
where J kl ' s consists of Mi (Mi — 3) many elements of {1, ... , m 2 }. 



Hence we have a total "singular mass" of 



E E / [^(x)-<p 2 (T^\x))]dx<-l + ^ + 

where c(Mi) is a constant depending only on Mi. 
(v) On the middle interval /fiddle = I m^+i we simply let t 2 = n = 0. 



3_ _,_ c(M t ) 
rri2 : 



(41) 



Let us illustrate graphically an interesting property of this construction, namely the 

Oil 



shape of the quasi-cost function ip 2 + ip 2 o T^ 1 " 2 
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2 



1 



- t 

o: 



Mi-l 
Mi 



1 



middle 



OC — 



■ 



D 



D 







Fig. 3. Shape of the quasi-cost ip 2 + ip 2 o T, 




The strips in this graphic representation symbolize the oscillations of the function ip 2 + ij) o 
. On the "singular" set, it achieves values of order —M 2 /M 2 . 

It will sometimes be more convenient to specify to which interval Ii lt i 2 the interval Ik t ,k 2 
is mapped under T& 2 \ instead of spelling out the value of t-i on the interval Ik 1 .k 2 - Note 
that by Lemma 3.2, for each map associating to (ki,k 2 ) a pair (^1,^2), there corresponds 
precisely one value r 2 \i k fc2 : IkiM ~ * i~ M2 + 1,. ..,0, ... ,M 2 — 1} such that (37) (resp. 
(38)) is satisfied and {I klM ) = I hM . 

Let us start with a "good" interval I kl , with k\ £ Jf as in (iii) above, say k\ £ 
{2, . . . , M ^~), for which we have T\{x) = — 1. Then the intervals Ik ± ,2, ■ ■ ■ , Ik 1 ,m 2 are 
mapped under Tal (x) = T~£(x) onto the intervals 1^—1,1, ■ ■ • , Ik 1 -i,m 2 -^- Defining 
t~2{x) — T i(x) on these intervals we get for x £ I kl ,k 2 , where 2 < fej < M ~ 1 , 2 < k 2 < rri2, 



We still have to define the value of t 2 (x), for x £ Ikx.i- The map Ta 2 has to map Ik x ,\ 
to the remaining gap Ik 1 ~i. m21 which happens to be its left neighbour. We do not explicitly 
calculate the unique number T2\i h t £ {— M 2 + 1, . . . , M2 — 1}, satisfying (37) (resp. (38)), 
which does the job, but only use the conclusion of Lemma 3.3 to find that, for x £ Ik t ,i such 



1 = <p\x) <?{T£\x)) = V \x) - V \T£\x)). 



(42) 



that T£ 2> (x) £ 



\l-[<f?{x)-<f?(T™(x))]\< 4M x 2 + 1. 



(43) 



This takes care of the "good" intervals Ik 1 , where fci £ {2, . . . , 



Mi-l 
2 



}• 
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1 

M 2 



E- ^Cfci-l) — It- I kl — I 

Fig. 4-a. k% £ Jf on the left side. 3 

For the "good" intervals It t , where k\ £ { Ml+3 , . . . , M\ — 1} we have ti(x) — 1 so 
that Ta 2 ^ maps the intervals I kul , h u m 2 -i to 4 1+ i,2, ■ • ■ , hi+i,m 2 - Again we define 



T 2{x) = Ti(x) = 1, for x in these intervals so that we obtain the identity (42), for Ml 2 +3 < 



2 

k\ < M\ — 1 and 1 < fc 2 < ni2 — 1. Finally, T^ 2-1 has to map Ik 1 ,m 2 to the interval 
so that again we derive an estimate as in (43). 



i 

Mi 




t— h-i — IE— I(ki+1) — * 

Fig. 4~b. ki £ Jf on the right side. 

This finishes item (iii) i.e. the definition of r 2 on the "good" intervals Ik ± ■ Noting that on 
this set we have t\ ^ r 2 only on M\ — 3 many intervals of length -p- we obtain the estimate 
(40). 

To show (iv) let us first consider the "singular" interval I\, on which we have T\(x) = 
and ip 1 {T i a T l 1 \x)) = i P 1 (Ti T 1 1 \x)) - cp 1 (x) = For the subintervals h M of I u 

define the set of good indices as J 1 ' 9 = J 1 ' 3 ' 1 U J 1 ' 5 ' 1 " where 

jl.g.i = r (Afi-3)(M!-l) | 1 m 2 -l j jl, fl ,r = f m 2 + l m ^ (Mi -3) (Mi +1) j 

Let us start by considering fc 2 £ J 1,9 ' r . We define 

r \ M i-3, r (Mi-3)(Mi + l) _ , Tlor 

r 2 (x) = n(a;) + — - — Mi = ^ ^ i, x e h M M £ J 1 * 3 ', 

First note that T^f then maps the intervals Ji,k 3 , for fc 2 G J 1,ff,r , to the intervals 

I m-l-1 m. 2 + l (Mi-3)(M! + 1) , ... , ./ a?! - 1 . 
2 ' 2 1 2 2 >" 12 

Observe that, for x as above, the orbit (T^ (x))^^ -1 always lies in the right halfs of the 
respective intervals I kl ■ 

Let us count how often the orbit (T£ (a;))^.^ -1 visits 1? and R 2 respectively, for x £ 
h,k a and fc 2 S J 1 ' 9 ' 1 ". The first n(a;) = elements of this orbit are all in L 2 which 

yields, similarly as in the induction step n = 1, 



^(^(z)) - ^ 2 (x) = tpHT&Hx)) - y\x) = 



3 Figure 3 is built with the small value m,2 = 7 for the sake of clarity of the drawing. But this value is 
not feasible since with the lowest mi = 5, (25) implies that 7712 is at least equal to 11; other requirements of 
the construction imply that it has to be even larger. 
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But the next Mi many elements of this orbit, namely 

(T' L ( rr \\ T1 ( x )+ Ml - 1 
y- L a 2 \ X ))i=T 1 (x) 

visit R 2 one time more often than L 2 as the unique element of this orbit which lies in -Tiydle 
belongs to the right half of kiddie- 



This phenomenon repeats on the orbit (T^ (x)) i=0 
that 



nW+^Mi-i for Ml _ 3 



many times so 



V*(x) - <f(T£)(x)) = <p 2 (x) - <f(T™(x))) + <f(T<£)(x)) - <f(T£\x) 

Mi - 3 Mi - 3 

= 1 

2 2 

= 0, for x e h k 2 and k 2 G J 1,9 ' r . 



(44) 



This takes care of I\ t k 2 with k 2 G J 1,9 ' r . 

For a; 6 ix fe with fc 2 £ J 1 ' 3 ' , the left half of the "good" intervals, we define symmetrically 



T-2 



(x) = T1 (X) - = _ (^-3)(Mx-l) ^ 



A similar analysis as above shows that T^ 2 ' maps the intervals ii,fc 2 , where fes € J 1,9,/ , to 
the intervals J m^-i , . . . , m2-1 (Mi-3)(Afi-i) ■ Hence by a symmetric reasoning we 

2 ' 2 ' 2 2 

again obtain equality (44) for x in the intervals ii,fc 2 , and for fc 2 G </ 1:9 ' r too. 

Now we have to deal with the "singular" subintervals Zi,fc 2 , where k 2 G J 1,s , and the 
singular indices are given by 



J 



1,8 



{!,..., m 2 }\ J 1 ' 9 



= {I,---, 



(Mi-3)(A/i-l) 



} U {m 2 - 



(Mi -3) (Mi +1) 



,m 2 }, 



which consists of Mi (Mi — 3) many indices. 

,02) 



The map Ta 2 has to map these intervals I\ k 21 where k 2 G J ■ , to the "remaining gaps" 
I Ml -i , in the interval /^i , where Z 2 G {gk- ^-yi-D , . . . , ™j±i + m-3)(M 1+ i } 

2 ' 2 2 

1}. Note that the corresponding intervals Im 1 -i are - roughly speaking - in the middle 
of the interval I M t -i , while the intervals li,fc 2 , with k 2 G J 1:S , are at the boundary of I\. 

2 

To define r 2 on Ii t k 2 > for fc 2 G J 1,s , choose any function r 2 taking values in {— M 2 + 
1,...,M 2 — 1}, satisfying (37) (resp. (38)) as above, which induces a bijection between 
the intervals (I\ k 2 )k 2 eJ 1 - 3 and the intervals I m-l-i considered above. 

2 ''2 




II ' ' I m 1 -i 

2 

Fig. 5. t 2 for the "singular" indices on the left side. 



J middle 
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In this drawing, the interval J 1 ^' is the union of the intervals I\,k 2 with k 2 G J l g l . A 
similar convention holds for I 1 ^' 7 " and I 1 ' 8 (which is not an interval anymore). 



For each such t 2 we obtain, for x € Ii,k 2 , k 2 G J 1:S , from Lemma 3.3 

^(x) - ^\x)) < + 10M? + 2 < Ml - 3 > 2 < Ml - 1 > 4M? 

<_^L + 20 M 4 . 
" 2Mi 1 



(45) 



Indeed, the leading term -jKjf an d the first error term lOMf in the first line above come 
from Lemma 3.3-(ii) when comparing the difference of the value of (p 2 on the interval Ji,i 
to that of J mi-i m 2 +i ■ For the difference of the value of ip 2 on I\^ 2 and I m x -\ ; , for 

arbitrary k 2 £ J 1, 8 and l 2 G - (Mi-3)(Af 1 -i) | . . . , 2^tl + (M t -3)^-1) } wc; 2 apply for 

both cases at most times estimate (i) of Lemma 3.3 which gives (45). 

In particular, for m 2 > 40Mf , which of course we shall assume, we have that 

p 2 (x) - v 2 {T^\x)) < 0, for x G h M ,k 2 G J 1 ' 8 . 

There are Mi (Mi -3) = M- 2 -3Ml many intervals I 1M with k 2 G J 1 ' 8 each of length 1/M 2 . 
Hence we may estimate the "singular mass" on the interval I\ by 



E / ^ - ^W^)] d* < ( - + 20M 4 )(M 2 - 3Mi)-L 



k 2 e.J 1 - 



13 c(Mi) 
" 2 2Mj 2m 2 



(46) 



where c(Mi) is a constant depending on Mi only. 4 

We still have another "singular" interval at the present induction step n = 2, namely 7mi • 
The analysis for this case is symmetric to the analysis of I\ and - after properly defining 
t 2 on this interval Im x ~ we arrive at the same estimate (46). In total, the thus obtain (41) 
by doubling the right hand side of (46), showing that the "singular mass" essentially equals 
-1. 

Finally define the sets J| (resp. J|) of "good" (resp. "singular") indices at level 2 as 

Jf = {(h,k 2 ) : (fci G Jf and 1 < k 2 < m 2 ), or (fci G J{ and k 2 G J fcl ' 9 )}, 
J| = {(fci.fe) : fci G JJ and fc 2 G J fel < 2 }. 

This finishes the inductive step for n = 2. 

General inductive step. Suppose that the prime numbers mi, . . . , m n _i have been de- 
fined. We use the notation a„_i = + • • • + M , where M n _\ = mi ■ m 2 ■ . . . ■ m„_i. 
For a prime m„ satisfying the condition of Lemma 3.2, and to be specified below, let 
M n = mi • . . . • m„ and 



L n = 



°^-2k ] ' Rn 



2 2M ' / ' middle 



1111 



2 2M„ ' 2 2M r , 



4 We shall find it convenient in the sequel to write c(Mi, M2, . . . , Mi) for constants depending only on 
the choice of the numbers Mi, M2, . . . , Mj. The concrete numerical value of this expression may change, 
i.e. become bigger, from one line of reasoning to the next one, but at every stage it will be clear that 
an explicit bound for the respective meaning of the constant c(Mi, M2, ■ . . , Mi) could be given, at least 
in principle. In fact, we shall always have that the constants c(Mi, M2, ■ ■ ■ , Mi) used in the sequel are 
dominated by a polynomial in the variables Mi , M2 , . . . , M» . 
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For 1 < k\ < mi, . . . , 1 < k n < m„, let 

j _ r fci-l , fc 2 -l i I k n — l fci -1 , fc 2 -l I I fc„ \ 

Jfei,... : fe„ — [ Ml T Ma ' ' M„ ' Mi "r" M 2 ~ T ~ M n /" 

For x G h,...,i and j G {0, . . . , M„} we define, similarly as in (33), <p n (x) = and 
<P n (Ti n (x)) = G {0, . . . ,j - 1} : G L™} 



(47) 

-#{i6{0,...,i-l}:T;( a; )en 



where a„ = a„-i + -p- and M„ = M n _im„. We also let tp n (x) = 1 - if n (x), for x G [0, 1). 

Lemma 3.4 (Oscillations of ip n ). For given Mi, . . . , M n -i there is a constant c(M\ , . . . , M„_i) 
depending only on Mi, . . . , M„_i, smc/i t/iat /or a// m„ as above we have 

(i) \Lp n (x)-<p n (x®j^)\<c(Mi,...,M n _i), 

(ii) for each 1 < k[, k" < Mi, . . . , 1 < k' n _ 1 , k' r ' l _ 1 < m n _i, 

<P n ir -<P n i > tt^ c(Mi,...,M„_i), 

fmj /or eac/i 1 < fc'^fc" < Mi,...,l < fc^_i,fc"_i < m n -i, and 1 < k' n ,k'^ < m n , with 
mm{k' n ,m n — k' n } < M„_i and min{fc", m„ — fc"} < M n _i we have 



k , . . . , _ ^ , k n k-^ ,...,k 



<c(Mi,...,M n _i). (48) 



Proof. We may and do assume that m n > 5M„_i. 

• Proof of (i). We have T an (x) = T ctn _ 1 (Ti /Mn (x)) so that 



T^{x)=x®^=x®^, (49) 

in perfect analogy to (34). As regards the analogue to (35) things now are somewhat more 
complicated. First note that there is a unique number 1 < q n -i < M„_i — 1 such that 

T^-_l(x)=xQ-^—, are [0,1). (50) 

lvl n— 1 

Indeed, by Lemma 3.2, when q n -i runs through {1, . . . , M n _i — 1}, the left hand side assumes 
the values x Q jj-^ , where l n -\ also runs through {1, . . . , M„_i — 1}. 

Claim: Letting r n = [ ^ n - j , the integer part of - , and taking q n -\ as m (50), we have 

T^ + ^(x)=x®^, 

where |c£ n _i| < M n _i. 

Indeed, write m n as m n — r„M„_i + e„_i, for some 1 < e„_i < M n _i to obtain 

T r n M n . 1+qn . 1{x) = (T^-.r-oT'-oTrw 

M„ 

M n -i 1 gn-1 

- x0r "lwr e M^7 mT 

m n e n -i 1 Q n _i 

— X © : , © — — — — — © — — — 

M„ M„ M n _i M„ 

_ gn-l - e»-l 

— ^ tt3 7-7 — . X t±) r~r 

M„ M„ 
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which proves the claim. 

Define s^lj = q n -i if d„_i = q n -i — e. n -i > and s| l 1 2 1 = + M„_i otherwise, to obtain 
by (49) and (50) that 

r,Jf,-i+«l'li l^li 
Ta n (x) — X © Mm , 

for some G {1, . . . , M„_i}. We also deduce from (49) that must actually be in 

{l,...,M n _!-l}. 

(2) (2) 

Repeat the above argument to find s y n _ 1 with -2M n _i < s y n _ 1 < 2M„_i such that 

J-a n (X) — X © -JJ^-i 

for some i^-j G {1, . . . , M„_i — 1}. Continuing in the same way, we find numbers s^-i, for 
j = 1, 2, ... , M„_! - 1 verifying -jM„_j < s^2 1 < jM„_! such that 

TL r : M - +s ^(x)=x^, (51) 

for some l^[L\ € {1, • • • , M n _i — 1}. Note that, under the assumption m n 3> M„_i so that 

r n 3> M„_i, the elements in (51) are all different. Therefore (Z^2. i 1 _ 1 runs through all 
elements of {1, . . . , M„_i — 1} when j runs through {1, . . . , M„_i — 1}; in particular there 
must be some jo such that 

T an (x) = X -p^-, 

in analogy to (36). 

Now observe that there is a constant c(M\, . . . , M„_i), depending only on Mi, . . . , M„_i, 
such that, for a; G [0, 1), the difference of the number of visits of the orbit (Tj^ (x))ll n - 1+qn - 1 
to L n and i?" is bounded in absolute value by the constant c(Mi, . . . , M n _i). The argument 
is analogous to the corresponding one in the proof of the claim which is part of the proof of 
Lemma 3.3-(i), and therefore skipped. 

The numbers jo as well as s^.\ are bounded in absolute value by M^_ 1 so that the differ- 

ence of the visits of the orbits (T* n (x)) i=0 " ™ _1 to L n and R n are bounded in absolute 
value by some constant c(M 1; . . . , M n _i). This finishes the proof of assertion (i). 

• Proof of (ii). Suppose first, as in the proof of Lemma 3.3-(ii), that (k[, . . . , k' n _ 1 ) = 
{k'{ , . . . , fc^_i) =: (fci, • • • , k n -i). For x G 4i,...,li„_i,i we have that each of the orbits 

(^"-^(i))^- 1 " 1 , for j = 0, . . . , Lal^rJ ~ 1 visits Ul one time more often than Rn - 
Hence 

Noting that 

T<*n " 1 (a;) = a; © [jm^I J m,/ 



and 

m n +l _ I m„ I M„_i ^ Mn-i 
2M„ L2M„_iJ M„ — M„ ' 

we obtain (ii) by using assertion (i), and possibly passing to a bigger constant c(Mi, . . . , M„_i). 
Finally the passage to general (k[, . . . , fc^-i) anc ^ W ' ■ • ■ > ^n-i) * s done again, similarly as 
in the proof of Lemma 3.3, by repeated application of (i) and by passing once more to a 
bigger constant c(Mi, . . . , M„_i). 
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• Proof of (iii). Fix 1 < k[ , k'( < M u . . . , 1 < k' n _ 1 ,k'^_ 1 < m n _i and 1 < k' n ,k'^ < m n 
as above. Suppose, e.g., k' n < M„_i and m n — k' 7 [ < M n _i, the other three cases be- 
ing similar. Denote by (k"', . . . , the index so that Ik'^' ,...,k"'_ 1 = Ik'{, ...,k"_ 1 , k'l © 
i.e. 4' 1 ",...,fe;"_ 1 is the right neighbour of 4»,... l fe»_ i . Now find < g„-i < M„_i 
such that TaZll maps /fe^ 1 ...,fej i _ 1 onto Ik'{' ....,k'"_ 1 ■ Hence TaZ 1 maps /fc^,...,fe^_ 1 ,fc^ onto 

4™ ^'Luti+g,,-!' 

Finally note that the distance from the latter interval to 1^ k"_ lt k'^ is bounded by 

(2M„_i + M„_i) -jj-. Hence we obtain (48) by applying 2M„_ 1 + M n _i times assertion 
(i) and using < q„_i < M n _i. □ 

After this preparation we are ready for the inductive step from n — 1 to n. Suppose that 
the following inductive hypotheses are satisfied, for 1 < I < n — 1, functions 77 : [0, 1) — >• 
{— Mj + 1, . . . , Mi — 1} and index sets J ; 9 , Jf contained in {(fci, . . . , k{) : 1 < k\ < mi, . . . , 1 < 
< m ; }. 

(i) The measure preserving bijection T^"' : [0, 1) -> [0, 1) maps the intervals Ifc,, 

for 1 < Z < n — 1, and 1 < fci < mi, . . . , 1 < fej < m;, onto the intervals T'i^(7fc li ... ; fc ! ). 
It induces a permutation of the intervals Ik!,...,k n -i , where 1 < k\ < mi,...,l < 

fcn-l < TOn-i. 

(ii) When r„_i(x) > 0, we have 

(*) £ C^iie- i = 0, . . . , r„_i (a), (52) 
and, when t„_i(x) < 0, we have 

^to^die- i = r n - 1 (x),...,0. (53) 

(iii) There is a set of "good" indices J 9 n _ Y C {1 < fci < m l7 ...,l < fc n _i < m„_i}. For 
(fci, . . . , fcn-2) S J^_ 2 we have that (fci, . . . , fc„_ 2 , fc n _i) S as well as 

M[A 1 ,...,fc B _ a n {r„_ 2 ^ r„_i}] < ^ff/it/fcx,...,^], (54) 

and 



E / b"- 2 (^)-^- 2 (riir 2 2) W)] 



Oi,...,fc„_ 2 )eJ«_ 



-[^(x)-^- 1 ^-)^))] 

^ c(Mi,...,M„- 2 ) 



(55) 



(iv) There is a set of "singular" indices J^-i Q {{k\, . . . ,k n -\) : 1 < fci < mi,...,l < 
fcn-i < m„_i}, disjoint from J„-\, such that J^-i consists of less than 2M^_ 1 many 
elements and such that 

l p n -\x)-^-\Til^\x)) < 0, l"r,- h ,,. , 

and (fci,...,fc„_i) G J*_ n , 

and 

E / [v n -\x)-^-\T^{x))]dx 

(fci-.-fen-OeJ-.! / fcl ,..^ B _ 1 (57) 

< _j , _3_ , c(Mi) j , c(Mi,...,M„_ 2 ) 

— mi m 2 m^-i ' 

where c(-) are constants depending only on (•). 
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(v) On the middle interval /^ iddlc = we have t\ = r 2 = • • • = r„_i = and I^ iddle 

i 

together with the intervals (ifei,...,fc„_ I )(fc 1 ,... 1 fc n _ 1 )ej' s x \jj a ± form a partition of [0, 1). 

We have to define r n as well as J% and J* so that the above list is satisfied with n — 1 
replaced by n. 

Let us illustrate graphically some features of this construction. Namely, the fractal 
structure of the singular set and the resulting quasi-cost. 



h 



'Mi 



If = 1 | 

n = 2 ■ 

» = 3 M. 



n = 4 



u. 



Fig. 6. The fractal structure of the "singular" set. 

For the sake of simplicity of the drawing, the red area which represents the singular set is 
thicker than it should be. Note also that the effective singular set is not perfectly balanced. 



2 

1 + 





-M n /Ml_ 1 



i 

Mi 



Ml-l 
Mi 



II II 



(T„) 



Fig. 7. Shape of the quasi-cost ip n + ip n o Ta 

The strips on this graphic representation symbolize the oscillations of the function (p n + ip n o 
On the "singular" set, this finction achieves values of order —M n /M^_ 1 . Of course, 



(t„) 



the effective singular set is much more fragmented than it appears on this figure. 

We start with a "good" interval Ifa k n -i > i- e - (^i> • • - > ^n-i) e an d simply write r 
for T n -i\i h _i • If T > 0j define J fe i>---> fe n-i> c ) where c stands for "change", as {m„ — r+1, 
. . . , m n }. This set consists of those indices k n such that the interval Iki_,...,k n is not mapped 



into T&r^Ifcx 



under T, 



(T„- 



If t < 0, we define J fel ' 



as {1, 



-!}• 
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The complement {1, . . . , m n }\J fel '""' fen_1 ' c is denoted by J kl kn - 1 ' u , where u stands for 
"unchanged" . 

Define T n := T n -\ = t on the intervals Ik 1 ,...,k n _ 1 ,k n , for k n G J"ii"-,«n-i,«_ p or x m one 
of those intervals we have by (52), (53) and (47) that 

<p n (x) - <p n (T<£\x)) = V n ~\x) <p n -\T&-?\x)), 

which yields (54) with n — 1 replaced by n. 

On the remaining intervals Ik u ...,k n with k n G jki,...,k n -i,c we define r„ such that it takes 
constant values in {— M n +1, . . . , M n — 1} on each of these intervals, such that (52) (resp. (53)) 
is satisfied, and such that these intervals Ik 1 ....,k n are mapped onto the "remaining gaps" in 

^a«-i (-^fci ,...,fe n _i ) • 

The crucial observation is that the intervals ife ll ...,fc n _ ll fe n where we have r„ 7^ t„_i, 
i.e. where fc„ G jfci,...,fe„-i,c^ are a u on j. ne "boundary" of 4 li ... i ^ n _ 1 : they are the |r| many 
intervals on the left or right end of /fe l! ..../t„_ 1 , depending on the sign of r. Similarly, the 
"remaining gaps" in T^_Tx\lk 1 ,...,k n _ 1 ) arc the r many intervals on the opposite end of 
7a^"" 1 1 ^(/fc l! ... ; fe rl _ 1 ). Hence we may apply assertion (iii) of Lemma 3.4 to conclude that 

\<p n {x) - <p n {Tfr\x))\ < c(Mi, . . . , M n _!), 

for those x G /fei,...,fe„_i wherer„(x) 7^ r n _i(a;). Summing over all "good intervals" fe n _u 

where (fei, . . . , fc„_i) £ we conclude that the contribution to (55), with n — 1 replaced 

by n, is controlled by the following factors: M„_i, which is a bound for the number of 
elements in J^_ 1 , times M n _i, which is a bound for |r|, times -p— , which is the length of the 
intervals /fc 1 ,...,fe„, times the above found constant c{M\, . . . , M n _i). In total, this implies 
the estimate (55), with n — 1 replaced by n. 

We now turn to item (iv), i.e. to the "singular" indices: fix hi, . . . , k n -i G Jn-i and let 
A(f denote the constant 

A V := V n -\T^(x)) V n -\x), x e Jfe,...,^.,, 

and again r the constant t„_ii J; _ fc , so that < Aip < \t\ < M„_i. 
Similarly as for the case n — 2 define 

Tfei,...,fc n _i,5,; _ rj,; jZ i 1 m„-l 1 7fei,...,fc n -i,g,r _ r m n +l i,r\ 

J — ISi K ra "T" 1 • • • J 2 J' J — I 2 ! ■ • ■ J Nil- 

Here fc^ is the largest number such that, for the orbit (T* n (x))l^' pMn ~ 1 1 and for x G 
^fei,...,fc n _i,fe^, all its members lie in the right half of the respective intervals Ik' 1 ,...,k' n _ 1 - In 
fact, we get as in the step n = 2 that k r n = m n — (r + A<^M n _i). 

Similarly fc^ is the smallest number such that, for the orbit (T£ {x)) T i= ^ ipMn ~ 1+1 and for 
x G Ik 1 k„-i,k> 7 au its members are in the left half of the respective intervals 7^/ k' 1 - 
We get k l n = t - A</?M„_i + 1. 
Now we define t„ as 

r„(x) =T + A^M n _i, for x e Ik 1 ,...,h n - 1 ,k n ,k n G J fe i.-- fe »-i.5. r , 

and 

r„(x) = r - A<^M„_i, for x G I fcl ,...,fc„_ 1 ,fc„,fc„ G jfci.-.^-i.fl.'. 
Similarly as in (44) at step n = 2, we get for k n G jki,-,k n -i,g .- jk 1 ,...,k n - 1 ,g,l uj k 1 ,...,k n - 1 ,g,r ) 



2G 



and x G Iki,...,kn-i,k n that 

<p n {x)-<p n {T<£\x)) 

= [<p n (x) <p n {T<£-J{x))] + [<p n {T<£-J{x)) <p n {T<£\x))] 

= [<p"-\x) - <p n -\T^\x))\ + [<p n {T<r r J(x)) - <p n {Tfr\x))] 

= -A(p + Aip = 0. 

We still have to deal with the "singular" indices 

jk^K.us := {!,..., mn } \ jfci.-.*»-i.fl = {!,..., k l n -l}U {¥ n + 1, . . . , m„}, 

which consists of 2A<^M„_i many indices. This number is bounded by 2M^_ 1 as Aip < 
\t\ < M„_i. These intervals have to be mapped onto the "remaining gaps" in the interval 

Ta"Si\lk 1 ....,k rl -i)- Make the crucial observation that, while the intervals Ik lt ...,k n -i,k n , f° r 
k n G jki,—,k n - i,s^ are the boundary of Ik lt . ..,fe n _i , the "remaining gaps" are in the middle 

of the interval Ti^ 1 '(// Sli ... j / Sn _ 1 ). This fact is analogous to the situation for n = 1 and 
n = 2. 

Now define r„ on the intervals Ik 1 ,...,k n - 1 ,k n for fc„ G J fe i.— .fcn.-i,« ) m g^jj a wa y that 

T<S£ n ^ maps these intervals onto the "remaining gaps" in ri^ 1 ^(7fe li ... j / Sn _ 1 ) and such that 
t„ is constant on each of these intervals, takes values in {—M n + 1, . . . , M n — 1} and such 
that (52) (resp. (53)) is satisfied with n — 1 replaced by n. Applying Lemma 3.4, assertion 
(ii) as well as 2(M„_i + l)|r| many times assertion (i) we obtain, for x G Iki,...,k n -i,k n an d 

k n G jfel.-,fcn-l, S) 

p n (aO - <p n {T<£\x)) < -gfc + c(M!, . . . , M n _!). 

Assuming that m„ is sufficiently large as compared to M n _i we have that the right hand 
side is negative. 

Keeping in mind that there are 2A<^M„_i many indices in J«i>---> fe n-i,s ; we ma y estimate 
the "singular mass" on the interval Ik 1 ,...,k n - 1 by 



£.--. i. . o Jfc-i ..... fc_ i fc~. 



<2A V M„_i[- 55 2»-+c(M 1 ,...,M n _ 1 )] -jr^- 



(58) 



w n _ 

M, 

We have by the inductive hypothesis that 



_ Ay m _ c(Mi M„_i) | 

_i L m„ J 



E / i*"- 1 

fel,...,fcn-ieJ^_ 1 " /J *l--.*n-l 



(x)- l p n - 1 (T^(x))}dx 



< _t , _3_ , c(M 1 ) j , c(Mi M„_ 2 ) 

— mi m2 m re _i ' 

or, writing now A<pfc 1) ... ) fe n _ 1 for the above value of Aip on the interval Ifci,....fc„_i, 



l 



M n _i / y fKi,...,K n -i _ mi m 2 m„_i 

fei,...,fe„_ie Ji._, 
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Letting := (J {(fci, . . . , fc„_i, fc„) : fc„ € jfa.-.^-i.sj we obtain from (58) 

k 1 ,...,k n - 1 EJ^_ 1 
fel,...,(c„£jj J Ik 1 ,...,k n 

< (- 1 I 3 I c(Mi,.,M„- 2 ) wi c(Mi,...,M„_i) x 

— ^ m-1 ' ' ' - m„_i ^ m„ ' 

_ _j | _3_ | , c(M 1 ,...,M„_ 2 ) , e(M 1 ,...,M„_ 1 ) 

where we may have increased the constant c(l, . . . , M„_i) in the last line. This concludes 
the inductive step. 



Construction of the Example: Let a = lim^oo a n so that T Q = lim„_ ) . 00 T an is the shift by 
the irrational number a. 

The sequence (t„)^L 1 of functions t„ : [0, 1) — » Z converges, by (54), almost surely to a 
Z- valued function r = lim„_ ! . 00 T n . Hence the maps converge almost surely to a 

map 

[0,1) -> [0,1) 

x ^ Ti T) {x)=T T a {x \x). 



T (r) . 



Using the fact that each is a measure preserving almost sure bijection on [0, 1), it is 

straightforward to check that is so too. 

Letting T T = {(x 1 Ta\x)) 1 x G [0,1)} in analogy to the notations L = {(x,x), x G 
[0,1)} and T 1 = {(x,T a (x)), x G [0,1)}, we define 



c{x,y) 



h+(x,y), if {x, y) G Lo U Li U r r , 
oo otherwise, 



where h is defined in (15) above. From this definition we deduce the almost sure identity, 
for t(x) > 0, 

h(x, T^\x)) = #{» g {0, . . . , r(x) - 1} : T a (x) G [0, §)} 

- #{t G {0, . . . , t(x) - 1} : T l a {x) G [i, 1)} + 1 (59) 

n— >oo 

a similar formula holding true for t(x) < 0. 

As regards the Borcl functions ((p n ,ip n )^ D =1 announced in (17), (18) and (19) above, we 
need to slightly modify the functions (<p n ,ip n )%L 1 constructed in the above induction to 
make sure that they satisfy the inequality 

<p n (x) + ip n (y) < c(x, y), for x G X, y G Y. (60) 

As c = oo outside of r U Fi U F T it is sufficient to make sure that the following inequalities 
hold true almost surely, for x G [0, 1) : 

(0) <p n (x) + i> n (x) < C(X, x) = 1, 

'2, for x G [0,i), 



(1) <p n (x) + MT*(x)) < c(x,T a (x)) = j 
(r) tp n (x)+1> n (TP(x)) < c(x,T^(x)). 



0, for x G [±,1) 
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The above constructed (<p™, ip n )^ =1 only satisfy condition (0). We still have to pass from ip n 
to a smaller function ip n - while leaving ip n := ip n unchanged - to satisfy (1) and (r) too. 
Let 

tp n (x) := ¥>"(*) - [<p n (x)+r(T a (x))-c(x,T a (x))} + 

[<p n (x)+r(Ti T \x)) c(x,T^(x)} + . 

Clearly ip n < ip n and the functions (tpmipn) satisfy the inequality (60). 

We have to show that the functions ip n defined in (61) satisfy that ip n — ip n is small in the 
norm of L 1 ^), as n — > oo, that is 

lim / {<p n (x) - <Pn{x)) dx = 0, (62) 

n ^°°J[0,l) 

provided that (m n )^° =1 increases sufficiently fast to infinity. 
We may estimate the first correction term in (61) by 

[ V n {x)+r{T a {x))-c{x,T a {x))] + 

< \r{T a {x)) - r(T a Ax))}+ + [cp n (x) + 1> n (T an (x)) - c(x, T a {x))}+. 

The second term above is dominated by l/™ iddlo which is harmless as || lj^ lddle lU 1 ^) = tjt- 
As regards the first term, note that T a (x) T an (x) = a — a n = X^n+i ~M~ which we may 
bound by M 2 +1 by assuming that (m n )^ =1 increases sufficiently fast to infinity. As tp n is 
constant on each of the M n many intervals ^ j n we get 



fi{xe [0,1) : ip n (T a (x)) ^V{T* n {x)} < M n (a-a„)< 



2 



m n +i 

On this set we may estimate, using only the obvious bound \4> n (x)\ < M n , that 
\1> n (T a (x)) - i> n {T an {x))\ < 2M n , x G [0, 1), 

to obtain 

\\ r {T a {x))~r{T a smw (P) <^- 1 . 

Hence for (m n )% D =1 growing sufficiently fast to infinity, the first correction term in (61) is 
also small in Z^-norm. 

To estimate the second correction term in (61) note that 

V n (x)+r(Ti T \x)) = l p n (x)+r(T^\x)), for x G [0,1). (63) 

Indeed, T^f* induces a permutation between the intervals Ik lt ...,k n and, by assertion (i) pre- 
ceding the formula (52), we have that Ta™*?^ maps the intervals I kl ,.... kn onto the intervals 
Ta^\lk 1 ....,k n )i for each j > 0. Noting that ip n is constant on each of the intervals Ik lt ...,k n 
we obtain (63), by letting j tend to infinity. 

By (47), tp n (x) + tp n (Ti T n n \x)) is the number of visits to L n minus the number of visits to 
R n plus one, of the orbit (Tj^ ) J™ q 2 ^ 1 . Similarly, by (15), h{x,T^ x \x)) is the number of 

visits to L minus the number of visits to R plus one, of the orbit (T^)J^ 1 . We have to 
show that the positive part of the difference 

/„(*) := W l {x) + r(Ti T :\x)) h + {x,T<?\x))] + , x G [0, 1), (64) 

is small in i 1 -norm, as n — > oo. To do so, we argue separately on /fiddle = [| — jMl > 5 + 2M^1 ' 

on the union of the "good" intervals at level n : G n = U(fei k )e J 3 ^ fe i s»i and the union 

of the "singular" intervals at level n, S n — {J( kl k n )e.J s ^ki,...,k n - 
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- For x E Middle' t ne correction term f n (x) in (64) simply equals zero as T n (x) — t(x) — 
0. 

- For x E S n , we have by (56) that <p n (x) + i\) n '{Tl n n (x] \x)) < 1 so that f n (x) < 1 too; 
hence lim,^^ ||/„ls n Wl 1 ^) = 0- 

- For x E G n , we use 

fn(x) < [<p n (x)+r(Ti T :\x)) h(x,T^(x))} + 

oo 

< J2 [{v k -\x) + ^-\T^){x)))-^\x)+^\T^\x)))] + 

fc=n+l 

and (55) to conclude that 

oo 

lim WUlaJw < lim £ fe^il = , 

This proves (62). 

Hence (17), (18) and (19) are satisfied. 

As regards assertion (16), let us verify that no and tti are optimal transport plans. 
Indeed, it follows from (17) and (18) that the dual value of the present transport problem 
is greater than or equal to one which implies that (c, 7To) = (c, 7Ti) = 1 is the optimal primal 
value. 

The fact that (c, ir T ) > 1 should be rather obvious to a reader who has made it up to 
this point of the construction. It follows from rough estimates. The set {[0, \) (~l {r = 

— 1}} U {[^, 1) fl {r = 1}} has measure bigger than 1 — + J2°^ 2 c ^ Ml ' ?n ' M '~ 1 ^ , which is 

bigger than, say, |, for (m n )^L 1 tending sufficiently quick to infinity. As c(x, T&\x)) equals 
2 on this set we get 

(c,7r T ) > | > 1. 

A slightly more involved argument, whose verification is left to the energetic reader, shows 
that, for e > 0, we may choose (m n )^ =1 such that 

(h,n T )>2-e. (65) 

Finally, we show assertion (iv) at the beginning of this section (see (20)). Let h E L 1 ^)** 
be a dual optimizer in the sense of [BLS09, Theorem 4.2]. We know from this theorem that 
there is a sequence ((p n , il) n )^ =1 of bounded Borel functions 5 such that 

(a) lim \\[ip n ®tp n - c]+\\ L i M = (66) 

n— >00 

08) lim ( f y n (x)dii{x) + f My) dv{y)) = 1, (67) 
Jx Jy 

(7) lim (p n © ip n — h r , 7r-a.s., (68) 
(S) his a c7(L 1 (7r)**,L 00 (7r)) cluster point of (ip n ® Vn)£°=i- (69) 

Here h = h r + h s is the decomposition of h E L 1 (-7r)** into its regular part h r E L 1 ^) and 
into its purely singular part h s E L 1 (-7r)**. 



5 The ((p n , ip n ) need not be the same as the special sequence constructed above; still we find it convenient 
to use the same notation. 
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We shall show that h r equals h, 7r-almost surely. Indeed by assertions (66) and (67) above 
we have that, for x G [0, 1), 

lim (<p n (x) + tp n {x)) = c(x,x) = h(x,x) = 1, 

n^oo 

and 



lim (ip n (x) +ip n {T a {x))) =c(x,T a {x)) = h{x,T a (x)) = 



2, for x G [0, \) 
0, for x G [i,l) 



the limit holding true in L 1 ([0, 1], /x) as well as for /i-a.e. x G [0, 1), possibly after passing to 
a subsequence. As in the discussion following [BLS09, Theorem 4.2] this implies that, for 
each fixed ieZ, 

lim (^(z) + VnCOa;))) = h(x,T* a (x)), i G Z, 

n— t-oo 

the limit again holding true in L 1 ^) and /i-a.s., after possibly passing to a diagonal subse- 
quence. Whence, we obtain with (68) that 

lim (<p n (x) + MTi T) (x))) = h(x,T^(x)) = h r (x,T^(x)), 

n— >oo 

convergence now holding true for ii-a.e. x G [0, 1]. 

As x — >• T„ r ^(x) is a measure preserving bijection we get 

/ \ip n {x) +MTi T) (x))]dx = / {<p n {x) + 1p n (x))dx = 1, 
i[0,l) >/[0,l) 

so that, using (65) we get 



n— t-oo 



Jim J [<p n (x) + ^n(Ti T Hs))]l {Vn(l) ^ B(r (r) (l))<fc(XiT M (l))} (x)dir 



= 1 ~ ™ / Q + ^( T « T) ( 2; ))] 1 { V n(x) + ^(Tt^(x))>/ l (x,T^)(x))}( a; ) dx 

= l-(h,n T ) 
< 0. 

From limn^oo^jx : ip n (x) + ip n (Ta\x)) < h(x,Ti T \x))} = we conclude that each a*- 

cluster point of ([<p n ( m ) + 0n(^a T '(-))]-)S? = i is a purely singular element of i 1 (7r)** of norm 
equal to (h,n T ) — 1. 

Finally, we still have to specify the prime numbers (m„)^ 1 in the above induction. 
It is now clear what we need: apart from satisfying the conditions of Lemma 3.1 as well 
as the requirements whenever we wrote "/or m n tending sufficiently fast to infinity" , we 
choose the (m n )'^L 1 inductively such that in (54) we have < 2-™, that in (55) we have 

c(Ml '-' M "- 2) < 2-" and in (57) we have -2- < \ as well as again c(Ml -"' M - 2) < 2"™. 

m n _i v ' mi 4 ° 

Hence we have shown all the assertions (i)-(iv) of Example 3.1 and the construction of 
the example is complete. □ 

4 A Relaxation of the Dual Problem 

As in [BLS09, Remark 3.4], for a given cost function c : X x Y — > [0, oo], we consider the 
family of pairs of functions 

(ip, if}) : ip, ip Borel, integrable and 
* rcl (/i,^) = { <p{x) +i>(y) < c(x,y), 7r-a.s, 

for each finite transport plan ir G II( / u, v, c) 
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and define the relaxed value of the dual problem as 

£> rel = sup{^ tpd^ + J V du : (<p,i/>) G * rol (M,^)}. (70) 

Using the notation of [BLS09] it is obvious that D < D rcl and it is straightforward to verify 
that the trivial duality inequality D rcl < P still is satisfied. One might conjecture - and the 
present authors did so for some time - that D rel = P holds true in full generality, i.e. for 
arbitrary Borel measurable cost functions c:IxF->[0,oo], defined on the product of two 
polish spaces X and Y. In this section we construct a counterexample showing that this is 
not the case, i.e. it may happen that we have a duality gap P — D Tcl > 0. The example will 
be a variant of the example in the previous section, i.e. the (n + l)'th variation of [AP03, 
Example 3.2]. 

In section 3 we constructed a measure preserving bijection T { a T) : [0, 1) ->• [0, 1) having 
certain properties; we now shall construct a sequence (7a Tn ^)£L °f sucn ma P s an d consider 
as cost function the restriction of h+, where h is defined in (15) to the graphs (r„)^L of 
the maps (T^" n ^)^L . This sequence also "builds up a singular mass", which now is positive 
as opposed to the negative singular mass in the previous section, but it does so in a different 
way. We resume the properties of these maps which we shall construct in the following 
proposition. 

Proposition 4.1. With the notation of section 3 there is an irrational a G [0,1) and a 
sequence (r n )^ of maps r„ : [0, 1) — > Z, with tq = and t\ = 1, such that the transforma- 
tions Ti Tn) : [0, 1) -> [0, 1), defined by 

T<r~\x)=T?W{x), xe[0,l), 

have the following properties. 

(i) Each T n is constant on a countable collection of disjoint, half open intervals in [0,1) 
whose union has full measure. For n > 0, the map Ta"^ defines a measure preserving 
almost sure bijection of ([0,1), /x) onto itself, where = v denotes Lebesgue measure 
on [0, 1). We have, for each n > 0, 

[ h{x,T^\x))dx = l. (71) 

J [0,1) 

(ii) The function 

f n (x) := h(x,T^\x)), I6[0,l), 
where h is defined in (15), satisfies 

\\fn-gn\\m M) <2- n (72) 

where g n is a Borel function on [0,1) such that 

V-{9n = 0} = 1 - Tin, ^{9n = ^f-} =1]n (73) 

for some sequence {r) n ) n ° =l tending to zero. 

(Hi) There is a sequence (<fi n ,ipn)n%-i of bounded Borel functions such that, for every fixed 
n G N, 

lim \\h{x,T^\x)) - [<p m {x)+1> m (T£~\x))]\\ L i W = 0, 

and 

1. 



lim 

n— >cc 



/ ip n (x)dx+ / ip„(y)dy 

J [0,1) J [0,1) 
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(iv) The sequence (Ti r "' l )^ 1 converges to the identity map in the following sense: 

5{x,T^\x))<2- n , xe [0,1), n>l, (74) 
w/iere <5(-, •) denotes the Riemannian metric on T = [0, 1). 

We postpone the proof of the proposition and first draw some consequences. Suppose 
that a as well as (Ta n ^) r ^L have been defined and satisfy the assertions of Proposition 4.1. 

Proposition 4.2. Fix M > 2 and define the cost function cm ■ [0, 1) x [0, 1) — > [0, oo] by 

Cm{x,v) 



' fc+fo W), for (x, y) m the graph of T°, 1* 7i T2 \ t£ m) , 

oo, otherwise. 



For this cost function cm we find that the primal value, denoted by P M , as well as the dual 
value, denoted by D M , of the Monge-Kantorovich problem both are equal to 1. 
In addition, there is (3 — f3(M) > 0, such that, for every partial transport 

a E n part ( M , v) := {o- : M(X x Y) : Px (tt) < »,p Y {n) < v} 

with 



\\o~\\ > | and / CM(x,y) da(x,y) < \, 

JXxY 

there is no partial transport g G n part (/x, v) with 

II " + £'|| = 1 an d °~ + Q G n(jti, v) 
with the property that g is supported by 

A' 3 = {{x,y)E[0,l) 2 :5{x,y)<p}. 

Proof. First note that there is an open and dense subset G C [0, 1) of full measure n{G) = 1 
such that cm, restricted to G x G is lower semi-continuous. This follows from assertion (i) of 
Proposition 4.1 by replacing the half open intervals by their open interior. Noting that G is 
polish we may apply the general duality theory [Kel84] to the cost function cm restricted to 
G x G to conclude that there is no duality gap for the cost function Cm\gxG- It follows that 
there is also no duality gap for the original setting of cm, defined on [0, 1) x [0, 1), either. 

We claim that, for every M > 0, the value D M of the dual problem equals 1. Indeed, let 
(ifinjipn)^! be a sequence as in Proposition 4.1 (hi). Defining 

M 

£n ■= - £>n(aO + ^ n {T^\ X )) - h{ X ,T^\x))] + 
3=0 

and tp n = ip n , we have that 

ip n (x) +4> n {y) < h(x,y) < h+(x,y), 
for all (x, y) in the graph of T°, T^,Ti T2 \ ri™\ and 



lim 

n— >oo 



/ (p n {x) dx+ ^ n {y)dy 
-Jx Jy 

showing that D M > 1. ft follows that D M = P M = 1. 



= 1, 
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Now suppose that the final assertion of the proposition is wrong to find a sequence 
(<t„)™ =1 G n part (^, v) with \\a n \\ > | and J XxY c M (x, y) da n (x,y) < |, as well as a sequence 
(8n)^Li S n part (^, v) with \\n n + g n \\ = 1 and 7r„ + p„ G n(^i, ^) such that g n is supported 

by 

A 1/ " = {(x,j/)e[0,l) 2 :5(x,y)<i}. (75) 

Considering (<7 n )%Li as measures on the product G x G of the polish space G, we then can 
find by Prokhorov's theorem a subsequence (cr nk )kLi converging weakly on G x G to some 
a- G II part (^, for which we find \\a\\ > | and J XxY c { x ,y) da(x,y) < i. By passing once 
more to a subsequence, we may also suppose that (g nk )kLi weakly converges (as measures 
on G x G or [0, 1) x [0, 1); here it does not matter) to some g G II part (/z, v) for which we get 
\\a + g\\ = 1 and cr + g G n(/x, ^). By (75) we conclude that g induces the identity transport 
from its marginal px{g) onto its marginal py (g) — Px(g). As cm{x, x) = 1, for x G [0, 1) we 
find that J XxY c M{x,y) dg(x,y) = \\g\\ < |, which implies that 



/ 



c M (x, y) d(ir + g)(x, y) < \ + \ 



3' 



a contradiction to the fact that P M — 1 which finishes the proof. □ 

We now can proceed to the construction of the example. 

Proposition 4.3. Assume the setting of Proposition J^.l. For a subsequence (ij)"j^2 °f 
{2, 3, . . .} we define the cost function c : [0, 1) x [0, 1) — > [0, oo] by 

c(X;2/) = {h+(x,y), for {x,y) in the support of T^T^T^ ,T ( J i3 \ . . . ,T^\ . . . (?6) 
l^oo, otherwise. 

If (*j)^2 tends sufficiently fast to infinity we have that, for this cost function c, the primal 
value P is strictly positive, while the relaxed primal value P rcl (see [BLS09, Example 4-3]) 
as well as the dual value D and the relaxed dual value D rcl (see (70)J all are equal to 0. 

In particular there is a duality gap P — D Tcl > 0, disproving the conjecture mentioned at 
the beginning of this section. 

Proof. We proceed inductively: let j > 2 and suppose that i n — 0, i\ — l,i2,-.-,ij have 
been defined. Apply Proposition 4.2 to 



cj(x,y) 



(n 2 ) rp{n 3 ) rp( T 'j) 



h+(x,y), for (x,y) in the support of T°, T a Z2 > ,T a <3 \...,T f 
oo, otherwise, 



to find f3j > satisfying the conclusion of Proposition 4.2. We may and do assume that 
j3j < min(/3i, . . . , f3j~\). Now choose ij+\ such that 

S(x,T^ +1 \x)) <(3j, are [0,1). (77) 

This finishes the inductive step and wcll-dcfincs the cost function c(x,y) in (76). 
(f<-) 

By (71) each T a 3 induces a Monge transport ■jr ij G n(^i, v) which satisfies 

h(x, y) d-Eij (x, y) — I h(x, Ta * j) ) dx = 1. 
IXxY Jx 

The fact that the relaxed primal value P rcl for the cost function c equals zero, directly 
follows from the definition of P rcl [BLS09, Section 1.1], (72) and (73) by transporting the 
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measure /il{ ffn= o}i which has mass 1 — T] n , via the Monge transport map Ta " where n is a 
large element of the sequence Hence we conclude from [BLS09, Theorem 1.2] that 

the dual value D of the Monge-Kantorovich problem for the cost function c defined in (76) 
also equals zero. 

Finally observe that we have D = D rcl in the present example: indeed, the set {(x, y) G 
[0, l) 2 : c(x, y) < oo} is the countable union of the supports of the finite cost Monge transport 

plans T°, T^ h) , T^ i2 \ T^^,. . . , so that the requirements <p(x) + %j)(y) < c(x, y), 
for all (x,y) G [0, l) 2 , and <p(x) + ip(y) < c(x,y), 7r-a.s., for each finite transport plan 
7T G II( / u, v), coincide (after possibly modifying ip{x) on a ^t-null set). 

What remains to prove is that the primal value P satisfies P > 0. We shall show that, 
for every transport plan ir G v), we have J XxY c { x -,y) dTr(x,y) > |. Assume to the 
contrary that there is n G II(/i, v) such that 

J c(x,y) dn(x,y) < §. 

XxY 

Denoting by Oj the restriction of 7r to the union of the graphs of the maps T°, T*, Ta n \ 

fTj is a partial transport in II part (/i, v) and the norms (||crj ||)^=i 
increase to one. Choose j such that 

Ikj-ll > I- 

We apply Proposition 4.2 to conclude that there is no partial transport plan gj such that 
ftj + Qj € n(/i, i/), and such that Qj is supported by A^. But this is a contradiction as 
Qj ■ = it — Gj has precisely these properties by (77). □ 

Proof of Proposition 4-1: The construction of the example described by Proposition 4.1 will 
be an extension of the construction in the previous section from which we freely use the 
notation. 

We shall proceed by induction on j G N and define a double-indexed family of maps 
r n j : [0, 1) — > Z, where 1 < n < j. 
Step j = l: Define 

ti,i : [0,1) ->Z 

as 

n,i = — ti, 

where we have mi = Mi, ai = and n as in (28) above. At this stage the only difference 
to the previous section is that we change the sign of t\ as we now shall build up a "positive 
singular mass", as opposed to the "negative singular mass" which we constructed in the 
previous section. More precisely, defining ip 1 , 1 ^ 1 as in (27), we obtain, similarly as in (29) 

0, forzG/ fel ,fciG{2,...,(Mi-l)/2, 
(Mi+3)/2,...,Mi-l}, 

(Mi - l)/2, for x G Ifcj , ki = 1, Mi, 

1, for x £ /( Ml +l)/2- 

This finishes the inductive step for 3 = 1. 

Step 3 = 2: Let m 2 and M 2 = Mim 2 be as in section 3, where m 2 satisfies the 
requirements of Lemma 3.1, and still is free to be eventually specified. To define ti j2 : 
[0, 1) — > Z we want to make sure that the map Ta^' 2 " 1 maps the intervals 1^ bijectively onto 
Tai'^ilki)- Using the notation of the previous section, we consider all the intervals Ik 1 as 
"poocf' intervals so that we do not have to take extra care of some "singular" intervals. 
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More precisely, fix 1 < k\ < Mi, and write r for Ti t i\i k . If r > 0, define J kl ' c as 
{m 2 — r + f,...,m 2 }, i.e. the set of those indices k 2 such that the interval Ik ± ,k 2 1S n °t 
mapped into T^'^ihx) under T^ 1 ' 1 ' '. If r < 0, we define J fcl ' c as {1, ... , |r|}, and if r = 0, 
we define J kl ' c as the empty set. The complement {!,... 7 m 2 }\J kl ' c is denoted by J kl ' u . 

Define n,2 := ti,i = t on the intervals /fe 1 ,fe 2 , for fc 2 £ J fel '". On the remaining intervals 
^fei,fe 2 w ith fc 2 G J fc i> c we define n,2 such that it takes constant values in {—M 2 + 1 , . . . , M 2 — 
I } on each of these intervals, such that (37) (resp. (38) is satisfied, and such that these 
intervals lk\M are mapped onto the "remaining gaps" in Tai'^ilki)- 

Using again Lemma 3.3 we resume the properties of the thus constructed map T^ 1 ' 2 ' : 

[o,i) -Mo,i). 

(i) The measure-preserving bijection X^ 1 ' 2 ^ maps each interval Ik x onto T^ 1 ' 1 ^^^). It 
induces a permutation of the intervals Iki,k 2 , where 1 < k\ < Mi, 1 < k 2 < m 2 . 

(ii) Defining (p 2 ,ip 2 as in (32) we get, for each 1 < k\ < Mi, similarly as in (39) and (40) 
as well as 

Ml r AM 2 
J2 / |(^ 1 (x)- V 1 (Ti^)( a; ))-(^( a: )-^(r^( a: ))|d a: < ! 

fe,=i J/fc i 



m 2 



(hi) On the middle interval /fiddle = Imi+± wc h avc T i,2 = = 0. 

We now pass to the construction of the map r 2j2 : [0, 1) — > Z. We define, for each 
1 < fci < Mi, and x G h u k 2 , 



a 2 (k 2 ), for fc 2 G {l,...,Mi} 

-Mi, for fc 2 G {Mi + 1, . . . , (m 2 - l)/2}, 

0, for fc 2 = (m 2 + l)/2, 

Mi, for fc 2 e {(m 2 + 3)/2,...,m 2 - Mi}, 

a 2 (k 2 ), for fc 2 G {to 2 - Mi + 1, ... , m 2 }. 

The definition of the function a 2 on the "singular" intervals Ik 1 ,k 2 , where fc 2 G {l,...,Mi}U 
{to 2 — Mi + 1, . . . , m 2 } is done such that T^ 2,2 ^ maps these intervals onto "remaining gaps" 
Ik lt l 2 , where l 2 runs through the set 

{(m 2 - l)/2 - Mi + 1, . . . , (m 2 - l)/2} U {(m 2 + 3)/2, . . . , (m 2 + 3)/2 + Mi - 1} 

in the middle region of the interval Ik t ■ As above we require in addition that a 2 on each 
Ik lt k 2 takes constant values in {—M 2 + 1, . . . , M2 — 1} and that (37) (resp. (38)) is satisfied. 

The function T2,2 mimics the construction of ti ; i above, with the role of [0, 1) replaced 
by each of the intervals Ik x , for 1 < k\ < M\. The idea is that, T^ 1 being the identity map, 
we have that T™ 1 satisfies T^(x) 
in the previous section now is taken by T^ 1 . 

More precisely, we have, for each k\ = 1, . . . , Mi, and x G Ik x ,k 2 

V \x)+^{T^\x)) = 

f0, for k 2 G {Mi + 1, . . . , (m 2 - l)/2, 

(m 2 + 3)/2,...,m 2 - Mi}, 
7 (Mi), for fc 2 G {l,...,Mi} U {m 2 - M x + l,...,m 2 }, 
for fc 2 = (m 2 + l)/2. 



x ® Ml an ^ Mr = i s smau - Hence the role of T ai 



m 2 
2 Mi 



(78) 
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The notation j(Mi) denotes a quantity verifying |7(Mi)| < c(Mi) for some constant c(M\), 
depending only on M\. The verification of (78) uses Lemma 3.3 and is analogous as in 
section 3. 

As Ti T 2 2 - 2) defines a measure preserving bijection on [0, 1), we get 

/ ((p 2 (x)+^ 2 (Ti T 2 ^\x)) dx= f (ip 2 (x)+^ 2 (x))dx = l. (79) 
Jo Jo 

This finishes the inductive step for j ' = 2. 

General Inductive step: For prime numbers mi, . . . , rrij-i as in the previous section 
suppose that we have defined, for 1 < n < j — 1 maps r n ,j ■ [0, 1) — > Z such that the following 
inductive hypotheses are satisfied. 

(i) For 1 < n < j — 1, the measure preserving bijection Ta™' 3 ^ 1 ^ ■ [0, 1) — > [0, 1) maps 
the intervals Ik-i,....k n -i onto themselves. It induces a permutation of the intervals 
Iki,...,ki-i ! where 1 < k\ < m\, . . . , 1 < fcj-i < m,j-\. 



(ii) For 1 < n< j - 1 we have, for 1 < fci < mi, . . . , 1 < fcj_ 2 < mj-2, 

lAhu...*,-* n {r n>J -_2 ^ r nij _i}] < ^/i[/ fcl ,...,fc,_ 2 ], (80) 



and 



E / |(^"- 2 (x)-^- 2 (r(;--)(x))) 

- j-\Tfrj->\x))) \dx < c(M ^- 2) - 



We now shall define r nj - : [0, 1) — >• Z, for 1 < n < j and Tjj : [0, 1) — > 7L. 

Fix 1 < n < j — 1 as well as 1 < fci < mi,...,l < fcj-i < mj-\. Denote by r 
the constant value r ni j_i|/ fei fc i . If r > define J fe i>---,fe 3 -i,c as j TO _. — t -|- l,...,mj}, 
similarly as for the case j = 2 above. If r < define jki,---,kj-i,c as |i _ ^ | T |} which, for 
t = 0, equals the empty set. On the intervals Iki,...,kj-i,kj where kj lies in the complement 
jki,...,kj-i,u = {i ) . . . ; mj}\J kl >-' k i- 1,c we define r„j :— r„j_i. On the remaining intervals 
Iki,...,kj-i,kji where kj e j' 1 '-'''- 1 ' , we define r n j in such a way that it takes constant 
values in {— Mj + 1, . . . , Mj — 1} on each of these intervals, such that (37) (resp. (38)) is 
satisfied, and such that these intervals Ik!,...,^-!,^ are mapped onto the "remaining gaps" 

in Ti^r 1 \h 1 ,...,k ] - 1 )- 

Similarly as in the previous section we thus well-define the function r n j which then 
verifies (80) and (81), with j — 1 replaced by j. 

We still have to define Tjj : [0, 1) — > Z. For 1 < k\ < mi, . . . , 1 < kj-i < rrij-i, we 
define Tj.j{x) on the intervals /fc 1 ,...,fe i _ 1 ,/fc j by 



aj(kj), 


for kj 


e{i,.. 






-Mj, 


for kj 




-i + 1, . . . , (mj 


-l)/2}, 


o, 


for kj 


= ( m J - 


f l)/2, 




Mj, 


for kj 


e {(m 3 


+ 3)/2,...,m J 


-M.-.i} 


^aj(kj), 


for kj 


G {m 3 


-M.-.i + l,... 


."»,•}• 
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Similarly as in step j = 2 the {—Mj + 1, . . . , Mj — l}-valued function aj(kj) is denned 
in such a way that Ta^ maps the intervals /fe 1 ,...,fc^_ 1 ,fc j with kj G {1, . . . , Mj_i} U {rrij — 
Mj-i + 1, . . . , mj} to the intervals Ik u ...,k j ^ 1 ,k j , where &j runs through the "middle region" 

{(mj - l)/2 - M,_! + 1, . . . , ( mj - l)/2} U {( mj + 3)/2, . . . , (m, + 3)/2 + M,-_i - 1}. 

We now deduce from Lemma 3.3 that, for x G Iki,...,kj-i,kj 

<p i (x)+^(T^(x)) = 

'0, for kj G {Mj-_i + l,...,( mj - l)/2} 

U m + 3)/2, ..., mj - M^}, 
+ 7 (M 1; . . . , M,-_i), for k, G {1, . . . , M,-_i} U { mj - M,-_i + 1, . . . , m,}, 
,1, for fcj = (mj + l)/2, 

where 7(M l7 . . . , Mj_i) denotes a quantity which is bounded in absolute value by a constant 
c(Mi, . . . , Mj_i) depending only on Mi, . . . , Mj_i. 
This completes the inductive step. 

We now define To = 0, n = 1 and, for n > 2 

r„ = lim r„_i i-7 -. (82) 

j->oo 

It follows from (80) that, for each n > 2, the limit (82) exists almost surely provided the 
sequence (m n )^L 1 converges sufficiently fast to infinity, similarly as in section 3 above. The 
(T n )%Lo an d the above constructed functions (<p n , ^pn)^Li satisfy the assertions of Proposition 
4.1. The verification of items (i), (ii), and (iii) is analogous to the arguments of section 3 and 
therefore skipped. As regards assertions (iv) note that, for 1 < n < j the function Ta™' 3 ^ 
maps the intervals Ik lt ...,k„-i on t° themselves. It follows that Ta n ^ does so too, whence 

5{x,T^\x))<M-\ 

which readily shows (74). □ 
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